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Transduction of recombinases for inducible gene targeting 

The present invention provides the use of a fusion protein comprising a 
site-specific DNA recombinase domain and a protein transduction domain 
for preparing an agent for inducing target gene alteration in a living 
organism or in cultured cells, suitable fusion proteins and a method for the 
production of said fusion proteins. 

Background 

For some years targeted mutagenesis in totipotent mouse embryonic stem 
(ES) cells has been used to inactivate genes, for which cloned sequences 
were available (Capecchi, Trends in Genetics 5, 70 - 76 (1989)). Since ES 
cells can pass mutations induced in vitro to transgenic offspring in vivo, it 
is possible to analyze the consequences of gene disruption in the context 
of the entire organism. Thus, numerous mouse strains with functionally 
inactivated genes ("knock out mice") have been created by this 
technology and utilized to study the biological function of a variety of 
genes. 

A refined method of targeted mutagenesis, referred to as conditional 
mutagenesis, employs a site-specific recombination system (e.g. Cre/loxP 
or Flp/frt - Sauer and Henderson, N. Proc. Natl. Acad. Sci. USA 85, 5166- 
5170 (1988); Senecoff et al., 3. Mol. Biol., 201, 405 - 421 (1988)) which 
enables a temporally . and/or spatially restricted alteration of target genes 
(Rajewsky et al., J. Clin. Invest., 98, 600 - 603 (1996)). The creation of 
conditional mouse mutants requires the generation of two mouse strains, 
i.e. the recombinase recognition strain and the recombinase expressing 
strain. The recombinase recognition strain is generated by homologous 
recombination in ES cells as described above except that the targeted 
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exon(s) is (are) flanked by two recombinase recognition sequences ; ' 
(hereinafter "RRS"; e.g. loxP or frt). The type of recombination event 
mediated by the recombinase depends on the disposition of the RRS, with 
deletions, inversions, translocations and integrations being possible 
(Torres and Kiihn, Oxford University Press, Oxford, New York (1997)). By 
placing the RRS into introns, an interference with gene expression before 
recombination can be avoided. The recombinase expressing strain 
contains a recombinase transgene (e.g. Cre, Flp) whose expression is 
either restricted to certain cells and tissues or is inducible by external 
agents. Crossing of the recombinase recognition strain with the 
recombinase expressing strain recombines the RRS-flanked exons from 
the doubly transgenic offspring in a prespecified temporally and/or 
spatially restricted manner. Thus, the method allows the temporal analysis 
of gene function in particular cells and tissues of otherwise widely 
expressed genes. Moreover, it enables the analysis of gene function in the 
adult organism by circumventing embryonic lethality which is frequently 
the consequence of gene mutation. For pharmaceutical research, aiming 
to validate the utility of genes and their products as targets for drug 
development, inducible mutations provide an excellent genetic tool. 
However, the current systems for inducible recombinase expression in 
transgenic animals suffer from a certain degree of leakiness in the 
absence of the inducer (Kiihn et al., Science 269(5229): 1427-9 (1995); 
Schwenk et al., Nucleic Acids Res.; 26(6): 1427-32 (1998)). Furthermore, 
the generation of conditional mutants is a time consuming and labor 
intensive procedure, since the recombinase recognition strain and the 
recombinase expressing strain have to be breed at least over two 
generations in order to obtain animals carrying both, the recombinase 
transgene and two copies of the RRS-flanked target gene sequence. 

Protein tranduction domains (hereinafter shortly referred to as "PTD") that 
have the ability to cross cell membranes were identified, e.g. in the 
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Antennapedia protein from Drosophila (Vives et al., J. Biol. Chem/-'"' 
272(25):16010-7 (1997)), Kaposi fibroblast growth factor (Kaposi FGF; 
Lin et al., J. Biol. Chem. 270: 14255-58 (1995)), VP22 from HSV (Elliott 
and O'Hare, Cell, 88(2):223-33 (1997)) and TAT from HIV (Green and 
Loewenstein, Cell, 55(6): 1179-88 (1988); Frankel and Pabo, Cell, 
55(6): 1189-93 (1988)). WO 99/29721 moreover mentions TAT mutants 
having an enhanced activity as compared to the wild-type peptide. 
Fusion of PTDs to heterologuous proteins conferred the ability to transduce 
into cultured cells (Fawell et al., Proc. Natl. Acad. Sci. USA, 91(2):664-8 
, -, (1994); Elliott and O'Hare (1997), Phelan et al., Nature Biotech. 16; 440- 
443 (1998) and Dilber et al., Gene Ther., 6(1):12-21 (1999)). Dalby and 
Bennett showed that a fusion protein consisting of VP22 and functional Flp 
recombinase translocated between cells in culture (from COS-1 cells 
transfected with VP22-Flp to CHO cells carrying Flp recognition sites (FRT 
sites); see Dalby and Bennett, Invitrogen, Expressions 6.2, page 13 
(1999)). Further WO 99/11809 mentions a fusion protein Antp-Cre and 
emphasizes that it may be used to deliver the Creinto the cell which 
recombines inside the cell nucleus. It is mentioned that the fusion protein 
is suitable for manipulating genomic DNA at precise locations in a 
temporal regulated manner. 

Furthermore, a recent report demonstrated that the 6-galactosidase 
protein fused to the 11 amino acids PTD from the HIV TAT protein can 
infiltrate all tissues of living mice reaching every single cell (Schwarze et 
al., Science, 285(5433):1569-72 (1999)). Finally, WO 99/60142 discloses 
vector constructs for gene therapy carrying a tumor cell sensitizing gene, 
a sensitizing gene expression regulatory system, a control gene and a 
control gene expression regulatory system, wherein the control gene can 
be a fusion gene consisting of a recombinase (viz. Cre or Flp) and a 
trafficking protein (viz. VP22). 
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With regard to the fusion protein Antp-Cre of WO 99/11809, it is however, -"V 
general knowledge in the art that the Antennapedia PTD is not a generally 
applicable transducing protein, namely it has only a limited activity with 
proteins having more than 100 amino acid residues (Derossi et al., Trends 
Cell Biol. 8: 84-87, 1998). In view of the limited transducing activity of the 
Antp PTD and the size of the generally known recombinases (ranging from 
about 200 to about 600 amino acid residues), it was desirable to provide a 
more potent system for the transduction of recombinases. It was, 
however, not clear for a person skilled in the art whether PTDs would be 
effective at all with recombinases for the following reasons: 

(i) only a single example of PTD-mediated delivery of proteins (above 100 
amino acid residues) in vivo has been reported so far (Schwarze et al., 
Science, 285(5433): 1569-72 (1999); Fawell et al., PNAS, 91: 664-68 
(1994); both references describing the TAT-mediated transduction of 8- 
galactosidase in mice); 

(ii) it is known that - due to defolding and refolding processes - the 
transduction of native proteins into cells may result in a significant loss of 
protein activity (e.g., as described for TAT-GFP; Schwarze et al, Trends 
Cell Biol. 10: 290-95 (2000)); 

(iii) neither the number of protein molecules that can be transferred into a 
cell by a given translocation domain has been systematically determined, 
nor the number of Cre molecules in the cell nucleus that is required for 
efficient recombination; 

(iv) the delivery of active proteins requires unfolding- and proper refolding 
which is unpredictable for a given protein (Bonifaci et al., AIDS 9: 
995-1000 1995); and 

(v) the mechanism by which protein transduction domains facilitate 
protein transduction in unknown and several findings have been published 
that rule out classical receptor-, transporter-, endosome- or endocytosis- 
mediated processes in the transduction of Ant, TAT and VP22 (G. Eliott, P. 
O'Hare, Cell 88, 223-233 (1997); D.A. Mann, A.D. Frankel, EMBO. J. 10, 
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1733-1739 (1991); D. Derossi et al., J. Biol. Chem. 269, 10444-10450 
(1994); D. Derossi et al., J. Biol. Chem. 271, 18188-18193 (1996); E. 
Vives etal., J. Biol. Chem. 272, 16010-16017 (1997)). 

Moreover, there was still the need for a generally applicable method where 
the genetic manipulation can be performed in both, endogenous genes 
and transgenes. 

Summary of the Invention 

It was found that site-specific DNA recombinase proteins can be 
translocated into cells of a living organism when fused to specific protein 
transduction domains, namely transduction domains being derived from 
the VP22 protein of HSV or from the TAT protein of HIV. Thus, whenever a 
gene mutation is desired, recombination is induced upon the injection of 
the appropriate site-specific recombinase fused to a transduction domain 
into such a living organism (provided, however, that said organism carries 
at least one appropriate RRS integrated in the genome). 

The present invention thus provides 

(1) the use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome; 

(2) a method for inducing gene alterations in a living organism which 
comprises administering to said living organism a fusion protein 
comprising a site-specific DNA recombinase domain and a PTD as defined 
in (1) above, wherein said living organism carries at least one or more 
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recognition sites for said site-specific DNA recombinase integrated in its'- -J 
genome; 

(3) a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a PTD being derived from the VP22 protein of HSV or from the TAT 
protein of HIV 

provided that when the site-specific DNA recombinase domain is wild-type 
Cre or Flp then the PTD is not the full length VP22 PTD of HSV (i.e., the 
fusion protein is not identical to the fusion protein of Dalby and Bennett, 
Invitrogen, Expressions 6.2, page 13 (1999) and of WO 99/60142); 

(4) a DNA sequence coding for the fusion protein of (3) above; 

(5) a vector comprising the DNA sequence as defined in (4) above; 

(6) a host cell transformed with the vector of (5) above and/or comprising 
the DNA of (4) above; 

(7) a method for producing the fusion protein of (1) above which 
comprises culturing the transformed host cell of (6) above and isolating 
the fusion protein; and 

(8) an injectable composition comprising the fusion protein as defined in 
(1) or (3) above. 

The invention is further illustrated by the appended Figures and is 
explained in detail below. 

Description of the Figures 

Fia. 1: Generation of induced mouse mutants using purified fusion 
proteins. 

A: Expression of the fusion protein consisting of the site-specific DNA 
recombinase (e.g. Cre) and the protein transduction domain (e.g. the HIV 
derived TAT peptide) in prokaryotic or eukaryotic cells. 
B: Extraction and purification of the expressed fusion protein (e.g. as 
described in Nagahara et al., Nat. Med. 4 (12):1449-52 (1998)). 
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C: Injection of the purified fusion protein into mice carrying the RRS- 
flanked target sequence. 

D: Analysis of the pattern of induced target gene recombination and the 
resulting phenotype. 
Triangle: RRS. 

Fig. 2: Scheme of the bacterial expression vector pT7-TACS (SEQ ID 
NO:16). The coding region of the 11 amino acid protein transduction 
domain of HIV TAT protein is fused to the N-terminus of the Cre 
recombinase protein sequence. The 10-amino-acid strep tag and the 
protease factor Xa recognition sequence are fused to the C-terminus. The 
T7 promoter permits expression of TAT-Cre protein in E. coli. 

Fig. 3: Detection of purified TAT-Cre protein by Coomassie staining and 
Western blot analysis. 

A: Coomassie stained SDS-PAGE gel. Lane 1: 10 kDa ladder (Life 
Technologies, Cat. No.: 10064-012), 2: 1000 ng BSA, 3: 750 ng BSA, 4: 
500 ng BSA, 5: 100 ng BSA, 6: 50 ng BSA, 7: 5 pi TAT-Cre, 8: 1 pi TAT- 
Cre in Bicine buffer. 

B: Western blot analysis using an alkaline phosphatase-conjugated anti- 
strep tag antibody (IBA, Cat. No: 2-1503-001). Lane 1: MultiMark 
(Invitrogen, Cat. No.: LC5725), 2: 7 pi TAT-Cre, 3: 5 pi TAT-Cre, 4: 2,5 pi 
TAT-Cre, 5: 1,25 pi TAT-Cre in Bicine buffer. 

Fig. 4: X-Gal staining of M5Pax8 cells treated with TAT-Cre protein. 
M5Pax8 fibroblasts where treated for 18 h with 3,5 (A), 6,9 (B) and 13,8 
pg/ml TAT-Cre protein (C) in serum-free medium. Four days after 
treatment, cells were fixed and stained with X-Gal. 

Fio. 5: Measurement of 6-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with increasing concentrations of TAT- 
Cre, as indicated, or transiently transfected with either expression vectors 
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for Cre (pCMV-I-Cre-pA, see SEQ ID NO:29) or B-galactosidase (pCMV-I- 
B-pA, see SEQ ID NO:30). Four days after treatment, cells were lysed and 
the B-galactosidase activities were determined. 

Fig. 6: PCR detection of TAT-Cre mediated recombination in mice. 
A: PCR-analysis of genomic DNA from duodenum (lane 2), liver (3), 
kidney (4), spleen (5), muscle (6), lung (7), tail (8) and brain (9) of a 
plnl3 mouse treated three times with intraperitoneal injections of 75 ug 
TAT Cre protein at two-day-intervals. Deletion of the loxP-flanked DNA 
segment is indicated by the presence of the about 400 bp fragment. Lane 
1: 1-kb-ladder (Life Technologies). 

B: PCR strategy to detect Cre-mediated deletion of the loxP-flanked DNA 
segment. Arrows indicate the positions of the primers. 
C: PCR-analysis of genomic DNA from spleen of a plnl3 mouse treated 
three times with intraperitoneal injections of 75 ug TAT Cre protein at two- 
day-intervals (lane 4). To confirm the presence of the BamH I restriction 
site, the PCR product was digested with BamH I which produces two 
diagnostic fragments of about 190 and about 210 bp (5). As a control, tail 
DNA from untreated mice carrying the loxP-flanked (lane 2) and the 
detected plnl3 allele (3) was subjected to PCR amplification. Lane 1: 100 
bp ladder (Life Technologies), lane 6: 1 kb ladder (Life Technologies). 

Fig. 7: Scheme of the bacterial expression vectors pT7-VPCS (SEQ ID 
NO:17) and pCRT7-AVPCS (SEQ ID NO:15). The coding region of the 301 
amino acid protein transduction domain of HSV VP22 protein (A) or the 
. truncated 143 amino acid AVP22 domain (B) is fused to the N-terminus of 
the Cre recombinase protein sequence. The 10-amino-acid strep tag and 
the protease factor Xa recognition sequence are fused to the C-terminus. 
The T7 promoter allows the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. cofi. The sequence in pCRT7-AVPCS encoding the 15 amino 
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acid N-terminal leader sequence is used for enhanced protein stability 
(Invitrogen). 

Fig- 8: Detection .of the purified VP22-Cre and AVP22-Cre fusion proteins 
by Coomassie staining and Western blot analysis. 
A: Detection of VP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: 1000 ng BSA, 3: 500 ng BSA, 4: 100 ng BSA, 
5: inclusion body protein extract before chromatography, 6: unbound 
protein, 7: fraction 17, 8: fraction 18, 9: fraction 19, 10: fraction 20. The 
position of the 75 kDa VP22-Cre protein is indicated by the arrow head. 
B: Detection of VP22-Cre protein by Western blot analysis using an 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 10, 5: 
fraction 11, 5: fraction 16, 6: fraction 17, 7: fraction 18, 8: fraction 19, 9: 
fraction 19, 10: fraction 20. 

C: Detection of AVP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: inclusion body protein extract before 
chromatography, 3: unbound protein, 4: fraction 1, 5: fractioTi 8, 6: 
fraction 9, 7: fraction 15, 8: 100 ng BSA, 9: 500 ng BSA, 10: 1000 ng 
BSA. The position of the 60 kDa AVP22-Cre protein is Indicated by the 
arrow head. 

D: Detection of AVP22-Cre protein by Western blot analysis using a 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 4, 5: 
fraction 8, 6: fraction 10, 7: fraction 12, 8: soluble protein extract before 
chromatography, 9: unbound protein, 10: fraction 7. 



F 'q- 9: X-Gal staining of M5Pax8 cells treated with VP22-Cre and AVP22- 
Cre fusion proteins. M5Pax8 fibroblasts where treated for 18 h with either 
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Bicine buffer (A), 0.5 ug/ml VP22-Cre (B) or 3.75 g/ml AVP22-Cre (C) in 
serum-free medium. Four days after treatment, cells were fixed and 
stained with X-Gal. 

Fig. 10: Measurement of 6-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with VP22-Cre, AVP22-Cre or Bicine 
buffer alone, as indicated or transiently transfected with expression 
vectors for Cre (pCMV-I-Cre-pA, see SEQ ID NO: 29) or B-galactosidase 
(pCMV-I-B-pA, see SEQ ID NO:30). Four days after treatment, cells were 
lysed and the 6-galactosidase activities were determined. 

Fig. 11: PCR detection of Cre mediated recombination in cells treated with 
VP22-Cre and AVP22-Cre fusion proteins shown in SEQ ID NOs: 21 and 
14, respectively). 

A: PCR-analysis of genomic DNA Isolated from M5Pax8 fibroblasts. Cells 
were transiently transfected with a Cre expression vector (lane 2) or 
treated for 18 h with either buffer alone (lane 3), 7.5 ug/ml VP22-Cre (4, 
5) or 15 ug/ml AVP22-Cre (6, 7) in serum-free medium. Four days after 
treatment, genomic DNA was extracted and subjected to PCR 
amplification. Deletion of the loxP-flanked DNA segment is indicated by the 
presence of the 226 bp DNA fragment. To confirm the presence of the Nco 
I restriction site in the recombined allele, the PCR products were digested 
with Nco I which produces two diagnostic fragments of 85bp and 141bp 
(lanes 5 and 7). Lane 1: 100 bp ladder (Life Technologies), lane 8: 1 kb 
ladder (Life Technologies). 

B: PCR strategy to detect Cre-mediated deletion of the loxP-flanked DNA 
segment. Arrows indicate the positions of the primers. 

Detailed Description of the Invention 

The expression "target sequences" according to the present invention 
means all kind of sequences which may be mutated (viz. deleted, 
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translocated, integrated and/or inverted) by the action of the 
recombinase. The number of RRS in the target sequence depends on the 
kind of mutation to be performed by the recombinase. For most of the 
mutations (especially for deletions and invertions) two RRS are required 
which are flanking the sequence to be mutated (deleted or inverted). For 
some kinds of integrations only one RRS may be necessary within the 
target sequence. 

The "living organisms" according to the present invention are multi-cell 
organisms and can be vertebrates such as mammals (e.g., rodents such 
as mice or rats) or non-mammals (e.g., fish) or can be invertebrates such 
as insects or worms, or can be plants (higher plants, algi or fungi). Most 
preferred living organisms are mice and fish. 

"Cell culture" according to the present invention include cells isolated from 
the above defined living organism and cultured in vitro. These cells can be 
transformed (immortalized) or untransformed (directly derived from the 
living organism; primary cell culture). 

The site-specific DNA recombinase domain within the fusion protein of the 
invention of the present application is preferably selected from a 
recombinase protein derived from Cre, Flp, <|>C31 recombinase (Thorpe and 
Smith, Proc. Natl. Acad. Sci, USA, vol. 95, 5505-5510 (1998)), y5 
resolvase (Schwickardi and Droge, FEBS letters 471:147-150 (2000) and 
R recombinase (Araki et al., J. Mol. Biol., 182, 191-203 (1985)). The 
preferred recombinases are Cre and mutants thereof (preferably the Cre 
variant of aa 15 to 357 of SEQ ID NO: 2 or aa 325-667 of SEQ ID NO: 6) 
and Flp and variants thereof including Flpe (preferably the Flp variant of 
aa 15 to 437 of SEQ ID NO: 4 or aa 325 to 747 of SEQ ID NO: 8). 
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The protein transduction domain according to the present invention 

includes, but is not limited to, the PTDs mentioned in Background of the 

Invention. The PTD preferably is derived from the VP22 protein of HSV or 

from the TAT protein of HIV. Suitable TAT proteins include, but are not 

limited to, proteins comprising (i) the amino acid sequence shown in SEQ 

ID NO: 10 and mutant thereof such as 

(ii) proteins comprising the amino acid 

AGRKKRRQRRR (SEQ ID NO: 22) 

YARKARRQARR (SEQ ID NO: 23) 

YARAAARQARA (SEQ ID NO: 24) 

YARAARRAARR (SEQ ID NO:25) 

YARAARRAARA (SEQ ID NO: 26) 

YARRRRRRRRR (SEQ ID NO: 27) 

YAAARRRRRRR (SEQ ID NO:28) 

as known from WO 99/29721. Preferred are transduction domains 
consisting of the TAT proteins (i) and (ii) above. 

Suitable VP22 proteins include, but are not limited to, the wild-type VP22 
protein, i.e., a protein comprising amino acids 1 to 302 of SEQ ID No: 21, 
and truncated forms thereof. Truncated VP22 proteins in accordance with 
the present invention can be those lacking 1 to 158 amino acid residues at 
their N-terminal end. The most preferred VP22 protein is the truncated 
VP22 PTD comprising amino acid residues 16 to 157 of SEQ ID NO: 14. 

The fusion of the two domains of the fusion protein can occur at any 
possible position, i.e., the protein transduction domain can be fused to the 
N- or C-terminal of the site-specific DNA recombinase or can be fused to 
active sites within the site-specific DNA recombinase. Preferably the 
protein transfusion domain is fused to the N-terminal of the site-specific 
DNA recombinase domain. 
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The protein transduction domain can be fused to the site-specific DNA 
recombinase either through a direct chemical bond or through a linker 
molecule. Such linker molecule can be any bivalent chemical structure 
capable of linking the two domains. The preferred linker molecule 
according to the present invention is a short peptide, e.g., having 1 to 20, 
preferably 1 to 10, amino acid residues. Specifically preferred short 
peptides are essentially consisting of Gly, Ala and/or Leu. 

The fusion protein of the invention of the present application may further 
comprise other functional sequences such as secretion conferring signals, 
nuclear localisation signals and/or signals conferring protein stabilisation. 

In case the fusion protein comprises a protein transduction domain 
derived from the TAT protein of HIV, the DNA sequence coding for said 
fusion protein preferably comprises the sequence 
5' TAC GGC CGC AAG AAG CGC CGC CAA CGC CGC CGC 3'. 

Such a preferred DNA sequence is for instance shown in SEQ ID NO: 11. 
In said sequence the 3' terminal codon ggc codes for the linker Gly. The 
DNA sequence of a suitable recombinase may be directly attached to said 
codon ggc. 

The fusion protein can be obtained by the following steps: 

1. Fusion of the recombinase coding region (e.g. encoding Cre: see amino 
acids 15 to 357 of SEQ ID NO: 2) with the sequence conferring protein 
translocation (e.g. the sequence encoding the TAT peptide 
YGRKKRRQRRR, SEQ ID NO: 10) using standard cloning protocols 
(Maniatis et al., Cold Spring Harbor Laboratory, New York (1989)) or 
chemical synthesis. 



WO 01/49832 



PCT/EP01/00060 



14 

2. Generation of a construct for the expression of the fusion protein in 
prokaryotic or eukaryotic cells, e.g. in E. coli DH5a (Hanahan, J. Mol. 
Biol.; 166(4) :557-80 (1983)) using the QIAexpress pQE vector (Qiagen, 
Hilden). 

3. Expression of the above mentioned fusion protein in prokaryotic or 
eukaryotic cells, e.g. in E. coli DH5a (Hanahan, 1983) 

4. Extraction and purification of the above mentioned fusion protein e.g. 
as described in Nagahara etal., Nat. Med., 4(12):1449-52 (1998). 

In an experiment it was shown that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. In this experiment a vector for the 
expression of a TAT-Cre fusion protein in E. coli was constructed, TAT-Cre 
protein was expressed in E. coli and purified from bacterial lysates. To test 
the activity of the TAT-Cre protein in vitro, a reporter cell line that 
contains a loxP-containing reporter construct was used. This reporter, 
when recombined by Cre recombinase, allows the expression of a B- 
galacosidase gene. Further, a transgenic mouse strain carrying a loxP- 
flanked target was used to Invest the activity of the TAT-Cre protein in " 
vivo. 

In a second experiment it was shown that VP22-mediated delivery of 
active Cre protein works with sufficient efficacy to facilitate inducible gene 
targeting. In this experiment Bacterial expression vectors were 
constructed for the production of VP22-Cre fusion proteins in E. coli. The 
activity of purified VP22-Cre proteins were tested using a reporter 
fibroblast cell line containing a loxP-flanked reporter construct. 

Thus, the injection of the purified fusion protein of the present invention 
into a living organism (e.g., a mouse) carrying a gene comprising the 
RRS-flanked target sequence (e.g., in an amount of 1 to 200, preferably 5 
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to 50 ug per g body weight). To demonstrate the feasibility of the 
invention, a reporter mouse strain carrying an RRS-fianked cassette was 
used (Thorey etal., Mol. Cell Biol., 18(10):6164 (1998)). 

Analysis is achieved by determining the pattern of induced target gene 
recombination (e.g. through PCR analysis, Southern blot analysis or X-Gal 
staining on tissue sections; Maniatis et al., 1989; Gossler and Zachgo, 
Joyner AL (Ed.), Oxford University Press, Oxford, New York (1993)). 

The procedure's advantages over current technology are as follows: 

/' '3 

(i) The absence of background recombination before administration of. 
the fusion protein. 

(ii) The reduction of time and resources which are necessary to combine 
the recombinase transgene and two copies of the RRS-flanked target 
gene by conventional breeding. 

In experiments it was shown the following: (a) With a suitable vector for 
the expression of a TAT-Cre fusion protein, a TAT-Cre fusion protein was 
expressed in E. coli and purified from bacterial lysates. 

(b) A reporter cell line containing a loxP-containing reporter construct was 
used to test the activity of the TAT-Cre protein in vitro. This reporter, 
when recombined by Cre recombinase, allows the expression of a 6- 
galacosidase gene. 

(c) A transgenic mouse strain carrying a loxP-flanked target was used to 
invest the activity of the TAT-Cre protein in vivo. 

These experiments demonstrate that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. 
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Furthermore, bacterial expression vectors were constructed for the 
production of VP22-Cre fusion proteins in E. coii. The activity of purified 
VP22-Cre proteins were tested using a reporter fibroblast cell line 
containing a loxP-flanked reporter construct. These experiments 
demonstrate that VP22-mediated delivery of active Cre protein works with 
sufficient efficacy to facilitate inducible gene targeting. 

The invention is further illustrated by the following, non-limitative 
examples. 

Examples 

Materials and Methods 

Construction of pT7-TACS: The TAT-Cre coding region was generated by 
PCR using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers 
TATcre sense (5'-atg cca tgg get acg gec gca aga age gec gec aac gec gec 
i- ! gcg gca tgt cca att tac tga ccg tac acc-3'; SEQ ID NO: 31) and TATcre 
antisense (5'-ttt egg ate cgc cgc ata.acc agt g-3'; SEQ ID NO:32) and 10 
ng pCMV-I-Cre-pA (see SEQ ID NO: 29) as template. The PCR reaction was 
performed using the following cycle profile: 2' 94 °C, 4 x (30" 94 °C min, 
30" 50 °C, 1' 72 °C), 12 x (30" 94 °C min, 30" 55 °C, 1' 72 °C) and 10' 
72 °C. The resulting PCR fragment was digested with Nco I and BamH I, 
treated with Klenow enzyme and ligated into the plasmid pBSII KS+ which 
had been opened with restriction enzyme BamH I, treated with Klenow 
and dephosphorylated with calf intestinal phosphatase. The resulting 
plasmid pBS TAT- 5 'ere was verified by DNA sequencing. The Plasmid 
pCMV-I-Cre-pA (SEQ ID NO: 29) was digested with Age I and Sal I which 
released a 1,036 kb fragment containing the 3' part of the Cre coding 
region. This fragment was ligated into the plasmid pBS TAT-5'cre which 
had been opened with Age I and Sal I. 



WO 01/49832 



PCT/EP01/00060 



17 

10 ng pBS-TATCre was subjected to PCR amplification using 20 pmol of 
primers FPA001 (5'-tat ate tag acc atg ggc tac ggc cgc aag aag c-3'; SEQ 
ID NO: 33) and FPA002 (5'-gct acc acg acc ttc gat acc ate gec ate ttc cag 
cag gcg c-3'; SEQ ID NO:34). PCR was performed using 2,5 U Platinum 
Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer Solution (Gibco BRL) 
according to the manufacturers protocol. The following cycle profile was 
used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 °C, 2'30" 68 °C). The 
amplified PCR fragment was purified using GFX columns (Amersham 
Pharmacia), digested with Xba I and ligated into the plasmid pASK57 
(Skerra and Arne, Gene 151: 131-135 (1994)) which had been opened 
with restriction enzymes Xba I and Eco 47 III and dephosphorylated with 
calf intestinal phosphatase. The resulting plasmid pASK75-TACS was 
digested with restriction enzymes Nco I and Hind III which released a 1,1 
kb fragment. The fragment was subsequently ligated into the plasmid pT7- 
7 (Studier and Moffatt, 3. Mol. Biol. 189: 113-130 (1986)) which had been 
opened with restriction enzymes Nco I and Hind III and dephosphorylated 
with calf intestinal phosphatase resulting In the plasmid pT7-TACS (SEQ ID 
NO:16). 

Construction of pT7-VPCS: The Cre coding region was generated by PCR 
using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers VP22cre 
sense (5'-taa eta gcg gee gca tgt cca att tac tga ccg tac ac-3'; SEQ ID 
NO: 35) and VP22cre antisense (5'-tcg age ggc cgc cat cgc cat ctt cca gca 
ggc g-3'; SEQ ID NO:36) and 10 ng pgkcre-pA (SEQ ID NO:40) as 
template. The PCR reaction was performed using the following cycle 
profile: 2' 94 °C, 5 x (30" 94 °C, 30" 50 °C, 2' 72 °C), 15 x (30" 94 °C, 
30" 55 °C, 2' 72 °C) and 10' 72 °C. The resulting PCR fragment was 
digested with Not I and ligated into the plasmid pVP22/Myc-His 
(Invitrogen), which had been opened with restriction enzyme NotI, 
dephosphorylated with calf intestinal phosphatase. The resulting plasmid 
pVP22-cre myc/His was verified by DNA sequencing. 
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10 ng pVP22-cre myc/His was subjected to PCR amplification using 20 
pmol of primers FPA004 (5'-tat ate tag aca tat gac etc teg ccg etc cg-3'; 
SEQ ID NO:37) and FPA002 (SEQ ID NO:34). PCR was performed using 
2,5 U Platinum Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer 
Solution (Gibco BRL) according to the manufacturers protocol. The 
following cycle profile was used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 
°C, 2'30" 68 °C). The amplified PCR fragment was purified using GFX 
columns (Amersham Pharmacia), digested with Xba I and ligated into the 
plasmid pASK57 (Skerra and Arne, Gene 151: 131-135 (1994)) which had 
been opened with restriction enzymes Xba I and Eco 47 III and 
dephosphorylated with calf intestinal phosphatase. The resulting plasmid 
pASK75-VPCS was digested with restriction enzymes Nde I and Hind III 
which released a 2,0 kb fragment. The fragment was subsequently ligated 
into the plasmid pT7-7 (Studier and Moffatt, 3. Mol. Biol. 189: 113-130 
(1986)) which had been opened with restriction enzymes Nde I and Hind 
III and dephosphorylated with calf intestinal phosphatase resulting in the 
plasmid pT7-VPCS (SEQ ID NO: 17). 

Construction of PCRT7-AVPCS: The AVP22-Cre coding region was 
generated by PCR using Platinum Pfx DNA polymerase (Life Technologies), 
20 pmol of the primers FPA007 (5'-ttc cga aga cga cga aac acc-3'; SEQ ID 
NO: 38) and FPA008 (5'-tat att cga age tta tta acc acc gaa ctg cg-3'; SEQ 
ID NO:39) and 30 ng pT7-VPCS (SEQ ID NO: 17) as template. The PCR 
reaction was performed using the following cycle profile: 2' 94 °C, 25 x 
(30" 94 °C, 30" 61 °C, 2'30" 68 °C) and T 68 °C. The resulting 1,8 kb 
PCR fragment was digested with Nco I and Sfu I and ligated into the 
plasmid pCRT7/VP22-l (Invitrogen), which had been opened with 
restriction enzymes Nco I and Sfu I, and dephosphorylated with calf 
intestinal phosphatase. The resulting plasmid pCRT7-AVPCS (SEQ ID 
NO: 15) was verified by DNA sequencing. 
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Expression of the fusion proteins in E. coli: E. coli BL21(DE3)-RIL cells 
(Stratagene) were transformed with pT7-TACS and grown on LB agar 
plates containing 100 ug/ml ampicillin. E. coli BL21(DE3)-RP cells 
(Stratagene) were transformed with pT7-VPCS and grown on LB agar 
plates containing 100 ug/ml ampicillin. E. coli BL21(DE3)-pLysS 
(Inyitrogen) were transformed with pCRT7-AVPCS and grown on LB agar 
plates containing 25 ug/ml kanamycine and 34 ug/ml chloramphenicol. 
Single colonies were isolated and used to prepare glycerol stocks. Eight 
5ml LB (Lura Bertani) aliquots containing antibiotics were inoculated with 
stabs from the glycerol stocks and grown overnight at 37°C with shaking. 
Two 5ml overnight cultures were each used to inoculate one of four 1L LB 
aliquots containing antibiotics and grown at 37°C with shaking. Growth 
rate was monitored by spectrophotometry at 578nm. When the cultures 
had obtained an OD578 = 0,5 expression of the fusion proteins were 
induced by the addition of 0,5 mM Isopropyl-B-D-l-thiogalactopyranosid 
(IPTG). Two hours after induction cells were harvested by centrifugation at 
12000xg and the pellet rapidly frozen in liquid nitrogen and stored 
immediately at -80°C. 

Purification of the fusion proteins from bacterial lysates: Each lOg cell 
pellet was resuspended on ice in 30ml Bicine buffer (50mM Bicine, pH 8,5) 
including one protease inhibitor tablet (Complete, Roche). Cells were lysed 
through threefold treatment (1500psi, 5 minutes) with the cell disruption 
bomb (Parr Instrument). 30ml of Benzonase (10000U, Merck) was added 
and cell extracts were incubated for 30 minutes at 4°C. Cell extracts were 
then centrifuged at 12,000xg (4°C). The pellet was redissolved in 8M 
urea, 50mM Bicine, lOOmM DTT, pH 8,5 by incubation for 16 hours at 
4°C. Protein extract was centrifuged at 31000xg and supernatant 
harvested. Protein extract was diluted in an equal volume of 
Chromatography buffer A (50mM Bicine, pH 8,5). PH was adjusted to pH 
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8,5 and the extract was filtered through a 0,45um filter (Millipore). FPLC 
(Akta Explorer, Amersham Pharmacia) was performed using a cation 
exchange column (Sepharose SP, Column body HR_5/5 (0.5 x 5cm), 
column volume (CV) 1ml, linear flow 300cm/hour, Amersham Pharmacia). 
After addition of sample to FPLC column, buffer was exchanged with 
Chromatography buffer A at 10 CV. 

TAT-Cre and VP22-Cre fusion proteins were eluted from the column by 
gradient elution using chromatography buffer B (50mM Bicine, 1M NaCI, 
pH 8,5) using the following profile: 0 - 50 % buffer B, 0 CV; 50 % buffer 
B, 10 CV; 50 - 100 % buffer B (linear gradient), 20 CV; 100 % buffer B, 
10 CV. AVP22-Cre protein was eluted from the column by gradient elution 
using the following profile: 0-10 % buffer B, 0 CV; 10 % buffer B, 10 CV; 
10 - 30 % buffer B, 0 CV; 30 % buffer B, 10 CV; 30 - 100 % buffer B, 0 
CV; 100 % buffer B, 10 CV. Three 1,5ml fractions each containing purified 
fusion proteins were collected. Purity and concentration of protein 
fractions were determined by Coomassie blue stained SDS-PAGE gels and 
Western blot analysis using dilutions of BSA standard solutions. In addition 
protein content was determined using a Bradford assay (Coomassie Plus 
protein assay, Pierce). 

SDS-PAGE and Western blot analysis: SDS-PAGE and Coomassie staining 
was performed according to standard protocols (Maniatis et al., Cold 
Spring Harbor Laboratory, New York (1989)) using 4 - 12 % gradient 
SDS-polyacrylamide gels (NuPAGE, Invitrogen, cat. no.: NP0321). 
Western blot analysis was performed using a Semi-Try Blotting Chamber 
(Biorad) and nitrocellulose membranes (0,2 urn; Schleicher & Schuell) 
according to the manufacturers protocols. The fusion proteins were 
detected by using an alkaline phosphatase-conjugated anti-strep tag 
antibody (IBA, Cat. No.: 2-1503-001) according to the manufacturers 
protocol. 
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Generation of the M5P ax8 Cre reporter cell line: The SV40-transformed 
murine embryonic fibroblast line MEF5/5 (Schwenk et al., Nucl Acids Res 
26(6), 1427-32 (1998)) was transfected with the vector pPGKpaXl 
(Kellendonk et al> Nucl. Acids Res. 24, 1404-11 (1996)). 10 6 MEF5/5 cells 
were electroporated with 20 ug pPGKpaXl plasmid DNA linearised with 
Sea I and plated into 48-well-plates. The cells were cultured in 
DMEM/Glutamax medium (Life Technologies) supplemented with 10 % 
fetal calf serum at 37°C, 10 % C0 2 in humid atmosphere. Two days after 
transfection the medium was supplemented with 5 ug/ml puromycine 
(Calbiochem) for the selection of stable integrants. 14 puromycine- 
resistant clones were expanded and tested by transien transfection with 
the Cre expression vector pPGK-Cre-pA (SEQ ID NO: 40). In two out of 
the 14 puromycine-resistant clones, the expression of B-galactosidase 
could be detected by staining with X-Gal. One of these clones, M5Pax8, 
was used as Cre reporter cell line. 

Transfection and mea surement of B-aalactosidase activity: Fibroblasts 
(10 6 cells per 24 well plate (Falcon)) were transfected with 25 ng pCMV-I- 
Cre-pA (see SEQ ID NO: 29) or pCNIV-I-B-pA (see SEQ ID NO: 30) plasmids 
using the FuGene transfection reagent (Roche Diagnostics). After 2 days 
the cells were lysed and the 6-galactosidase activities were determined 
with the 6-galactosidase reporter gene assay (Roche. Diagnostics) 
according to the manufacturers guidelines using a Lumistar luminometer 
(MWG). 

Histochemical detection of 6-aalactosiri ase activity: To quantitate 6- 
galactosidase expression, fibroblast cells were washed once with 
phosphate buffered saline (PBS), and the cells were fixed for 5 minutes at 
room temperature in a solution of 4% formaldehyde in PBS. Next, the cells 
were washed twice with PBS and finally incubated in staining solution for 
24 hours at 37°C (staining solution: 5 mM K3(Fe(CN)6), 5mM 
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K4(Fe(CN)6), 2mM MgCI2, Img/ml X-Gal (BioMol) in PBS). Blue stained, 
B-galactosidase positive cells were detected and distinguished from 
negative (transparent) cells in a cell culture binocular microscope under 
200x magnification. For each determination a minimum of 200 cells was 
counted. 

PCR detecti on of Cre- mediated recombination: Genomic DNA extracted 
from tissue samples was subjected to PCR using Taq-polymerase (Gibco 
BRL Cat. No. 10342-020) using 20 pmol of each primer (sense: 5^-CAT 
CTC CGG GCC TTT CGA CCT G - 3', antlsense: 5' -GCG ATC GGT GCG 
_ GGC CTC TTC - 3'; SEQ ID Nos: 41 and 42, respectively). PCR was 

performed using the following cycle profile: 2' 94°C, 35 x (30" 94°C, 30" 
55 °C, V 72 °C), 10 min 72 °C. PCR products were separated on a 1,2 % 
agarose gel. 

Example 1 

The vector pT7-TACS (SEQ ID NO: 16) was constructed for the expression 
of a TAT-Cre fusion protein in E. coli. The plasmid contains the coding 
region of the 11 amino acid protein transduction domain of the wild-type 
HIV TAT protein (Green and Loewenstein, Cell, 55(6): 1179-88 (1988); 
Frankel and Pabo, Cell, 55(6): 1189-93 (1988); SEQ ID NO:10) fused to 
the N-terminus of Cre recombinase protein sequence. The 10-amino-acid 
strep tag at the C-terminus allows the detection and purification of the 
fusion protein using specific antibodies (Schmidt and Skerra, 3. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site (Ile-Glu-Gly-Arg) permits the removal of the strep tag by proteolytic 
cleavage. The estimated molecular weight of the TAT-Cre fusion protein is 
42 kDa. A scheme of the TAT-Cre expression vector is depicted in figure 2. 
For the expression of TAT-Cre, the E. coli strain BL21(DE3)-RIL 
(Stratagene) was used. This strain carries an IPTG-inducible T7 
polymerase gene and additional copies of the tRNA genes for the' Yare 
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codons' argU, ileY and leuW. 

E. coli BL21(DE3)-RIL cells were transformed with pT7-TACS and grown in 
LB medium containing 100 ug/ml ampicillin. The expression of the 40 kDa 
TAT-Cre fusion protein could be strongly induced by the addition of 0,5 
mM IPTG to the culture medium. Analysis of protein lysates revealed that 
approximately 50 % of TAT-Cre protein accumulated as insoluble inclusion 
bodies. The inclusion bodies where extracted and dissolved in 8 M urea. 
TAT-Cre was subsequently purified from this fraction using ion exchange 
chromatography. The quantity and purity of TAT-Cre protein was 
determined using Coomassie stained SDS-PAGE gels and Western blot 
analysis (figure 3). The purification process yielded TAT-Cre protein 
extracts of 64 % purity and a concentration of 100 ug/ml. 
To analyse the ability of the purified TAT-Cre protein to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 (R. Kuhn, 
unpublished) that contains a loxP-containing reporter construct. This 
reporter, when recombined by Cre recombinase, allows the expression of 
a 13-galacosidase gene (Buchholz et al, Nucleic Acids Res. 24, 4256-4262, 
1996). Cells were cultured for 18 h with increasing concentrations of TAT- 
Cre protein in serum-free medium and analysed 4 days later for 6- 
Galacosidase activity. Staining with X-Gal showed that > 50 % of the cells 
treated with 13,8 ug/ml TAT-Cre protein expressed B-galactosidase 
indicating recombination of the loxP-flanked reporter construct had 
occurred (figure 4). Measurement of 6-galactosidase activity in cell 
lysates revealed an up to 30-fold higher level of B-galactosidase activity 
in comparison to cells which had been transiently transfected with an 
eukaryotic Cre expression vector (figure 5). 

To investigate the activity of TAT-Cre protein in a living organism, we used 
a transgenic mouse strain carrying a loxP-flanked target for Cre-mediated 
recombination (Thorey etal., 1998, Mol. Cell. Biol. 18: 3081 - 3088). Mice 
where treated three times with intraperitoneal injections of 75 ug TAT Cre 
protein at two-day-intervals and analysed 2 days later. Genomic DNA was 
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isolated from a variety of organs and subjected to PCR amplification which ' 
specifically amplifies a 400 bp fragment of the recombined allele. The 
deleted allele could be detected in multiple tissues from treated mice 
indicating TAT-Cre-mediated recombination in these organs (figure 6). 
This experiments demonstrates that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
In cell lines and in living organisms. 

Example 2 

The vectors pT7-VPCS (SEQ ID NO: 17) and pCRT7-AVPCS (SEQ ID NO: 15) 
were constructed for the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. coli. The VP22-Cre gene of pT7-VPCS contains the full length 
protein translocation domain of the HSV VP22 protein (ENiott and O'Hare, 
Cell, 88(2): 223-33 (1987), whereas the AVP22-Cre gene of pCRT7-AVPCS 
contains a truncated VP22 protein transduction domain (amino acids 159 - 
301; Invitrogen; aa 16-157 of SEQ ID N0:14) fused to the N-terminus of 
Cre recombinase protein sequence. A 10-amino-acid strep tag at the C- 
terminus of Cre protein sequence allows the detection and purification of 
the fusion proteins using specific antibodies (Schmidt and Skerra, J. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site permits the removal of the Strep tag by proteolytic cleavage. The 
estimated molecular weight is 75 kDa for VP22-Cre protein and 60 kDa for 
AVP22-Cre protein. A scheme of the vectors pT7-VPCS and pCRT7-AVPCS 
is depicted In figure 7. 

E. coli BL21(DE3)-RIP cells (Stratagene) were transformed with pT7-VPCS 
and cultured in LB medium containing 100 pg/ml ampicillin. E. coli 
BL21(DE3)-pLysS cells (Stratagene) were transformed with pCRT7-AVPCS 
and cultured in LB medium containing 25 pg/ml kanamycine and 34 pg/ml 
chloramphenicol. Expression of the VP22-Cre and AVP22-Cre fusion 
proteins could be induced by the addition of 0,5 mM IPTG to the culture 
medium. Analysis of protein extracts using Coomassie staining and 
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Western blotting of SDS-PAGE gels revealed that 50 - 60 % of VP22-Cre 
and AVP22-Cre proteins accumulated as insoluble inclusion bodies. The 
inclusion bodies where extracted and dissolved in 8 M urea. VP22-Cre and 
AVP22-Cre fusion proteins were subsequently purified using ion exchange 
chromatography. The quantity and purity of the isolated VP22-Cre and A 
VP22-Cre fusion proteins was determined using Coomassie stained SDS- 
PAGE gels and Western blot analysis (figure 8). 
To analyse the ability of the purified fusion proteins to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 that contains a loxP- 
containing reporter construct. When recombined by Cre recombinase, the 
reporter allows the expression of a 6-galacosidase gene (Buchholz et al, 
Nucleic Acids Res. 24, 4256-4262, 1996). The cells where cultured for 18 
h with increasing concentrations of VP22-Cre and AVP22-Cre in serum-free 
medium and analysed 4 days later for B-Galacosidase activity. Staining 
with X-Gal showed ~2 % blue cells in the cultures treated with up to 15 
ug/ml AVP22-Cre indicating recombination of the loxP-flanked reporter 
construct had occurred. In contrast, cell cultures treated with up to 0,5 
ug/ml VP22-Cre did not show any X-gal staining (figure 9). Measurement 
of cell lysates revealed a strong increase of 6-galactosidase activity upon A 
VP22-Cre treatment when compared to untreated cells (figure 10). 
Genomic DNA was isolated fand subjected to PCR amplification that 
specifically amplifies a 250 bp fragment of the recombined allele. The 
deleted allele could be detected in cells treated with both VP22-Cre and A 
VP22-Cre fusion proteins (figure 11). 

This experiment demonstrates that VP22-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting. 
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SEQUENCE LISTING 
<110> ARTEMIS Pharmaceuticals GmbH 

<120> Transduction of recombinases for inducible gene 
targeting 

<130> 010007wo/JH/ml 

<140> 
<141> 

<160> 42 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1074 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<220> 

<221> CDS 

<222> (1) . . (1071) 

<400> 1 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec . 48 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 14 4 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

ctg gca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 336 
Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 ~ 110 

cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea* 38 4 
His Arg Arg Ser . Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 
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ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gee ggt gaa cgt 432 
Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 
Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 
Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

ttt ctg ggg att get tat aac ace ctg tta cgt ata gee gaa att gee 576 
Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 
Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate cat att ggc aga acg aaa acg ctg gtt age acc gca ggt gta gag 672 
He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp lie Ser 
225 230 235 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 768 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

aga aaa aat ggt gtt gee gcg cca tct gee acc age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

cgc gee ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 
Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gee tgg tct gga cac agt 912 
Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

gee cgt gtc gga gee gcg cga gat atg gee cgc get gga gtt tea ata 960 
Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 

ccg gag ate atg caa get ggt ggc tgg acc aat gta aat att gtc atg " 1008 
Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 ~ 350 

ctg gaa gat ggc gat tag 1074 
Leu Glu Asp Gly Asp 
355 



<210> 2 
<211> 357 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<400> 2 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu . ' " 
100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 
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Pro Glu lie Met Gin Ala Gly Gly Trp Thr Asn Val Asn lie Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp 
355 



<210> 3 
<211> 1317 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<220> 

<221> CDS 

<222> (1) . . (1311) 

<400> 3 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg agt 4 8 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

caa ttt gat ata tta tgt aaa aca cca cct aag gtc ctg gtt cgt cag 96 
Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 30 



ttt gtg gaa agg ttt gaa aga cct tea ggg gaa aaa ata gca tea tgt 

Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 

35 40 45 

get get gaa eta acc tat tta tgt tgg atg att act cat aac gga aca 

Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 

50 55 60 



144 



192 



gca ate aag aga gec aca ttc atg age tat aat act ate ata age aat 24 0 

Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 

65 70 75 80 

teg ctg agt ttc gat att gtc aac aaa tea etc cag ttt aaa tac aag 288 

Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 

85 90 " 95 

acg caa aaa gca aca att ctg gaa gec tea tta aag aaa tta att cct 336 

Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu He Pro 

100 105 ' 110 

get tgg gaa ttt aca att att cct tac aat gga caa aaa cat caa tct 38 4 

Ala Trp Glu Phe Thr He He Pro Tyr Asn Gly Gin Lys His Gin Ser 

115 120 125 

gat ate act gat att gta agt agt ttg caa tta cag ttc gaa tea teg 432 

Asp He. Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 

130 135 140 

gaa gaa, gca gat aag gga aat age cac agt aaa aaa atg ctt aaa gca 480 

Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 

!45 150 155 160 
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ctt eta agt gag ggt gaa age ate tgg gag ate act gag aaa ata eta 528 
Leu Leu Ser Glu Gly Glu Ser lie Trp Glu He Thr Glu Lys He Leu 
165 170 175 

aat teg ttt gag tat ace teg aga ttt aca aaa aca aaa act tta tac 57 6 
Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 ! 185 ~* 190 

caa ttc etc ttc eta get act ttc ate aat tgt gga aga ttc age gat 624 
Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 "* 205 

att aag aac gtt gat ccg aaa tea ttt aaa tta gtc caa aat aag tat 672 
He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

ctg gga gta ata ate cag tgt tta gtg aca gag aca aag aca age gtt 720 
Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 
225 230 235 240 

agt agg cac ata tac ttc ttt age gca agg ggt agg ate gat cca ctt 768 
Ser Arg His He Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 
245 250 " • 255 

gta tat ttg gat gaa ttt ttg agg aat tct gaa cca gtc eta aaa cga 816 
Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

gta aat agg acc ggc aat tct tea age aac aaa cag gaa tac caa tta 864 
Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 280 ** 285 

tta aaa gat aac tta gtc aga teg tac aac aag get ttg aag aaa aat 912 
Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

gcg cct tat cca ate ttt get ata aag aat ggc cca aaa tct cac att 960 
Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
305 310 315 320 

gga aga cat ttg atg acc tea ttt ctg tea atg aag ggc eta acg gag 1008 
Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 t 335 

ttg act aat gtt gtg gga aat tgg age gat aag cgt get tct gec gtg 105 6 
Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 350 

gec agg aca acg tat act cat cag ata aca gca ata cct gat cac tac 1104 
Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala lie Pro Asp His Tyr 
355 360 365 

ttc gca eta gtt tct egg tac tat gca tat gat cca ata tea aag gaa 1152 
Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro lie Ser Lys Glu 
370 375 "* 380 

atg ata gca ttg aag gat gag act aat cca att gag gag tgg cag cat 1200 
Met He Ala Leu Lys Asp Glu Thr Asn Pro lie Glu Glu Trp Gin His 
385 390 395 400 

ata gaa cag eta aag ggt agt get gaa gga age ata cga tac ccc gca 1248 
He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser lie Arg Tyr Pro Ala 
405 410 ' 415 
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tgg aat ggg ata ata tea cag gag gta eta gac tac ctt tea tec tac 1296 
Trp Asn Gly lie lie Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 
420 425 ~ 430 

ata aat aga cgc ata taatga 1317 
lie Asn Arg Arg lie 
435 : 



<210> 4 
<211> 437 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<400> 4 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

Gin Phe Asp lie Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 ~ 30 

Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 " " 45 

Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
65 70 75 80 

Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 90 95 

Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu He Pro 
100 105 HO 

Ala Trp Glu Phe Thr He He Pro Tyr Asn Gly Gin Lys His Gin Ser 
115 120 125 

Asp He Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 
130 135 140 

Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 
145 150 155 160 

Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys lie Leu 
165 • 170 175 

Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 185 190 

Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 205 

He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

Leu Gly. Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 
225 230 235 240 
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Ser Arg His He Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 
245 250 " " 255 

Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 ■ 280 285 

Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
305- 310 315 320 

Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 * "* 335 

Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 350 

Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 
355 360 365 

Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro lie Ser Lys Glu 
370 375 380 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 
385 390 395 400 

He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr Pro Ala 
405 410 415 

Trp Asn Gly He He Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 
420 425 430 

lie Asn Arg Arg lie 
435 

<210> 5 
<211> 2004 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<220> 

<221> CDS 

<222> (1) . . (2001) 

<400> 5 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10 -.15 



gat gag tac gag gat 
Asp Glu Tyr Glu Asp 
20 



ctg tac tac acc ccg tct tea ggt atg gcg agt 
Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
25 30 



96 
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Pro III Q 9t u° g n Ct ? a ° aCC tCC Cgc cgt ggc 9 CC cta ca S aca cgc 14 4 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 y 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser III 
30 55 60 

tat gcc etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 240 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

fff p^ C £ 9g f£ g ° gg cgt ccc gtt tcc 9gg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tcc gga ggg gec gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 

ioo 105 no 

cgc aca ccc acc ace gee ccc egg gec ccc cga acc cag egg gtg gcg 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val All 
115 120 125 

act aag gee ccc gcg gec ccg gcg gcg gag acc acc cgc ggc agg aaa 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr A?g Gly Aig Lys 
130 135 140 

ttl li* r? g n° a 2t a o CC g f° gca ctc cca gac gcc ccc 9°9 tc <3 acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 

145 150 155 160 

111 ^ mK C « ga o CC f ag a u a CCC gcg cag ggg ct 9 9 CC a 9 a aa ? ctg 528 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 

165 170 175 

S a ° £u fc agC aCC gcc ccc cca aac ccc 9 ac 9°g cca tgg acc ccc egg 57 6 
Hxs Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

3 tg 9 ?° S? 0 ttfc aac aag cgc 9 tc ttc fc 9 c gcc gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly A?g Leu 
195 200 205 

ti g f, tg Cat gCC Cgg atg gcg 9 C 9 9tc cag ctc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Tr? Lp Me? Ser 
210 215 



384 



432 



220 



a™ ^ T C gaC Ct ° aaC gaa ctc ctt ^ c atc *cc acc 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
zZo 230 — ~ 



720 



235 240 



ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag coc acc aac 7fiS 
He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ma Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gcc acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Are Ilv a™ tit ?? 9 ?? 9 tCg CgC CCC acC gag cga cct c 9 a 9 CC 864 
Tnr Arg Gly Arg Ser Ala Ala- Ser Arg Pro Thr Glu Arg Pro Arg Ala 

275 280 285 
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cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt ace gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 ' 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

agt ggc gge cgc atg tec aat tta ctg ace gta cac caa aat ttg cct 1008 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 
Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 124 8 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 ~ 430 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

gtt gat gcc ggt gaa cgt gca aaa cag get eta gcg ttc gaa .cgc act 1392 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala -Leu Ala Phe Glu Arg Thr 
450 455- 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 * 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 1488 
Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 ^ 495 

cgt ata gcc gaa att gcc agg ate agg gtt aaa gat ate tea cgt act 1536 
Arg He Ala Glu He Ala Arg He Arg Val Lys Asp lie Ser Arg Thr 
500 505 "* 510 

gac ggt ggg aga atg tta ate cat att ggc aga acg- aaa acg ctg gtt 1584 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 ~ 525 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 
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gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp lie Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gec gcg cca tct gec 1728 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

acc age cag.cta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1776 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gee tgg tct gga cac agt gec cgt gtc gga gec gcg cga gat atg gec 1872 
. Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1920 
Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 ^ 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat tag 2004 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 

<210> 6 
<211> 667 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<400> 6 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 HO 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
U5 120 " 125 
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Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lvs 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 ' 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leii His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 ~ 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440. " 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 
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Asp Phe Asp Gin Val 
4 65 

Asp He Arg Asn Leu 
485 

Arg He Ala Glu He 

500 ! 

Asp Gly Gly Arg Met 
515 

Ser Thr Ala Gly Val 
530 

Val Glu Arg Trp He 
545 

Tyr Leu Phe Cys Arg 
565 

Thr Ser Gin Leu Ser 
580 

His Arg Leu He Tyr 
595 

Ala Trp Ser Gly His 
610 

Arg Ala Gly Val Ser 
625 

Asn Val Asn He Val 
645 

Gly Ala Met Val Arg 
660 



Arg Ser Leu Met Glu Asn Ser 
470 475 

Ala Phe Leu Gly He Ala Tyr 
490 

Ala Arg He Arg Val Lys Asp 
505 

Leu He His He Gly Arg Thr 
520 

Glu Lys Ala Leu Ser Leu Gly 
535 540 

Ser Val Ser Gly Val Ala Asp 
550 555 

Val Arg Lys Asn Gly Val Ala 
570 

Thr Arg Ala Leu Glu Gly He 
585 

Gly Ala Lys Asp Asp Ser Gly 
600 

Ser Ala Arg Val Gly Ala Ala 
615 620 

He Pro Glu He Met Gin Ala 
630 635 

Met Asn Tyr He Arg Asn Leu 
650 

Leu Leu Glu Asp Gly Asp 
665 



Asp Arg Cys Gin 
480 

Asn Thr Leu Leu 
495 

He Ser Arg Thr 
510 

Lys Thr Leu Val 
525 

Val Thr Lys Leu 



Asp Pro Asn Asn 
560 

Ala Pro Ser Ala 
575 

Phe Glu Ala Thr 
590 

Gin Arg Tyr Leu 
605 

Arg Asp Met Ala 



Gly Gly Trp Thr 
.640 

Asp Ser Glu Thr 
655 



<210> 7 
<211> 2247 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<220> 

<221> CDS 

<222> (1) . . (2241) 

<400> 7 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 4 8 
Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 
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ccc gat agt ccg cct gac acc tec cgc cgt ggc gec eta cag aca cgc 14 4 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 " 60 

tat gee etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg. gag 24 0 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gee gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 ^ 110 

cgc aca ccc acc acc gee ccc egg gee ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

act aag gee ccc gcg gee ccg gcg gcg gag acc acc cgc ggc agg aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gee cag cca gaa tec gee gca etc cca gac gee ccc gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gee aga aag ctg 528 
Ala. Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gee ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gee ggc ttt aac aag cgc gtc ttc tgc gee gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gee atg cat gee egg atg gcg gcg gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gec aac 7 68 
He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 



gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gee acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 * 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gee 8 64 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
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cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 

305 310 315 320 

agt ggc ggc cgc atg agt caa ttt gat ata tta tgt aaa aca cca cct 1008 
Ser Gly Gly Arg Met Ser Gin Phe Asp lie- Leu Cys Lys Thr Pro Pro 
325 330 " 335 

aag gtc ctg gtt cgt cag ttt gtg gaa agg ttt gaa aga cct tea ggg 1056 

Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 " 350 

gaa aaa ata gca tea tgt get get gaa eta acc tat tta tgt tgg atg 1104 

Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

att act cat aac gga aca gca ate aag aga gcc aca ttc atg age tat 1152 

lie Thr His Asn Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

aat act ate ata age aat teg ctg agt ttc gat att gtc aac aaa tea 1200 

Asn Thr He He Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser 

385 390 395 400 

etc cag ttt aaa tac aag acg caa aaa gca aca att ctg gaa gcc tea . 1248 

Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr He Leu Glu Ala Ser 
405 410 415 

tta aag aaa tta att cct get tgg gaa ttt aca att att cct tac aat 1296 

Leu Lys Lys Leu He Pro Ala Trp Glu Phe Thr He He Pro Tyr Asn 
420 425 430 

gga caa aaa cat caa tct gat ate act gat att gta agt agt ttg caa 1344* 

Gly Gin Lys His Gin Ser Asp He Thr Asp lie Val Ser Ser Leu Gin 
435 440 445 

tta cag ttc gaa tea teg gaa gaa gca gat aag gga aat age cac agt 1392 

Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 
450 455 460 

aaa aaa atg ctt aaa gca ctt eta agt gag ggt gaa age ate tgg gag 1440 

Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 

465 470 475 480 

ate act gag aaa ata eta aat teg ttt gag tat acc teg aga ttt aca 1488 

He Thr Glu Lys lie Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

aaa aca aaa act tta tac caa ttc -etc ttc eta get act ttc ate aat 1536 

Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

tgt gga aga ttc age gat att aag aac gtt gat ccg aaa tea ttt aaa 1584 

Cys Gly Arg Phe Ser Asp lie Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

tta gtc caa aat aag tat ctg gga gta ata ate cag tgt tta gtg aca 1632 

Leu Val Gin Asn Lys Tyr Leu Gly Val lie lie Gin Cys Leu Val Thr 
530 535 540 
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gag aca aag aca age gtt agt agg cac ata tac ttc ttt age gca agg 1680 

GJu Thr Lys Thr Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg 

545 550 555 560 

ggt agg ate gat cca ctt gta tat ttg gat gaa ttt ttg agg aat tct 1728 
Gly Arg lie Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

gaa cca gtc eta aaa cga gta aat agg acc ggc aat tct tea age aac 1776 
Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

aaa cag gaa tac caa tta tta aaa gat aac tta gtc aga teg tac aac 1824 
Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 ~ 600 605 

aag get ttg aag aaa aat gcg cct tat cca ate ttt get ata aag aat 1872 
Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro lie Phe Ala He Lys Asn 
610 615 620 

ggc cca aaa tct cac att gga aga cat ttg atg acc tea ttt ctg tea 1920 
Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

atg aag ggc eta acg gag ttg act aat gtt gtg gga aat tgg age gat 1968 
Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

aag cgt get tct gee gtg gee agg aca acg tat act cat cag ata aca 2016 
Lys Arg Ala Ser Ala Val Ala Arg Thr Thr Tyr Thr His Gin He Thr 
660 665 670 

gca ata cct gat cac tac ttc gca eta gtt tct egg tac tat gca tat 2064 
Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680 685 

gat cca ata tea aag gaa atg ata gca ttg aag gat gag act aat cca 2112 
Asp Pro He Ser Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro 
690 " 695 700 

att gag gag tgg cag cat ata gaa cag eta aag ggt agt get gaa gga 2160 
He Glu Glu Trp Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 710 715 720 

age ata cga tac ccc gca tgg aat ggg ata ata tea cag gag gta eta 2208 
Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

gac tac ctt tea tec tac ata aat aga cgc ata taatga 2247 
Asp Tyr Leu Ser Ser Tyr He Asn Arg Arg lie 
740 745 



<210> 8 
<211> 747 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<400> 8 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 
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Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 " 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 • 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg' Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Gin Phe Asp lie Leu Cys Lys Thr Pro Pro 
325 330 335 



Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 350 
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Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 ~ 360 365 

lie Thr His Asn Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

Asn Thr He He Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser 
385 390 395 400 

Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr He Leu Glu Ala Ser 
405 * 410 415 

Leu Lys Lys Leu He Pro Ala Trp Glu Phe Thr He He Pro Tyr Asn 
420 425 430 

Gly Gin Lys His Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin 
435 . 440 445 

Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 
450 455 460 

Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

Leu Val Gin Asn Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr 
530 535 540 

Glu Thr Lys Thr Ser Val Ser Arg His He Tyr Phe Phe Ser Ala Arg 
545 " 550 555 560 

Gly Arg He Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 " 600 605 

Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro He Phe Ala He Lys Asn 
610 615 620 

Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

Lys Arg Ala Ser Ala Val Ala Arg Thr Thr Tyr Thr His Gin He Thr 
660 665 670 



Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680 685 
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Asp Pro lie Ser Lys Glu Met lie Ala Leu Lys Asp Glu Thr Asn Pro 
690 '695 700 

He Glu Glu Trp Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 710 715 720 

Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

Asp Tyr Leu Ser Ser Tyr He Asn Arg Arg He 
740 745 



<210> 9 
<211> 33 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 9 

tacggccgca agaagcgccg ccaacgccgc cgc 33 

<210> 10 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 10 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 


11 


<211> 


42 


<212> 


DNA 


<213> 


Human 


<220> 




<221> 


CDS 


<222> 


(4) 


<400> 


11 



(42) 



atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc 42 
Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
15 10 



<210> 12 
<211> 13 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 12 

Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
1 5 10 



<210> 13 
<211> 1623 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag 

<220> 

<221> CDS 

<222> (1) . . (1617) 

<400> 13 

atg get age atg act ggt gga cag caa atg ggt egg gat ccg teg acg 4 8 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
1 5 10 15 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gec aga aag ctg 96 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

cac ttt age acc gec ccc cca aac ccc gac gcg cca tgg acc ccc egg 14 4 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 45 

gtg gec ggc ttt aac- aag cgc gtc ttc tgc gec gcg gtc ggg cgc ctg 192 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

gcg gec atg cat gee egg atg gcg get gtc cag etc tgg gac atg teg 240 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met ' Ser 
65 70 75 * f 80 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 288 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
85 90 95 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gee aac 336 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gec acg gcg 38 4 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
115 120 125 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gec 432 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 140 

cca gec cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 480 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 528 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 

165 170 175 

agt ggc ggc cgc atg tec aat, tta ctg acc gta cac caa aat ttg cct 576 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 624 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 67 2. 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 
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ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 720 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 

225 230 235 240 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 768 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
245 250 255 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 816 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
260 265 270 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 864 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 " 285 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 912 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 

290 295 300 

gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 960 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 

305 310 315 320 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1008 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac ace ctg tta 1056 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
340 345 350 

cgt ata gee gaa att gec agg ate agg gtt aaa gat ate tea cgt act 1104 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1152 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 

370 375 380- 

age ace gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1200 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 

385 390 395 400 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1248 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gec gcg cca tct gee 1296 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1344 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
435 440 445 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1392 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 

450 455 460 

gec tgg tct gga cac agt gec cgt gtc gga gee gcg cga gat atg gee 1440 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 

465 470 475 " 480 
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cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg ace 1488 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 495 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1536 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 510 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 1584 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 525 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 1623 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 

530 535 



<210> 14 
<211> 539 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag 

<400> 14 

Met Ala Ser Met Thr 
1 5 

Ala Pro Thr Arg Ser 
20 

His Phe Ser Thr Ala 
35 

Val Ala Gly Phe Asn 
50 

Ala Ala Met His Ala 
65 

Arg Pro Arg Thr Asp 
85 

He Arg Val Thr Val 
100 

Glu Leu Val Asn Pro 
115 

Thr Arg Gly Arg Ser 
130 

Pro Ala Arg Ser Ala 
145 

Leu Gly Ser Thr Ser 
165 

Ser Gly Gly Arg Met 
180 

Ala Leu Pro Val Asp 
195 



Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
10 15 

Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
25 30 

Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
40 45 

Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
55 60 

Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
70 75 80 

Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
90 95 

Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
105 110 

Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
120 125 

Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
135 140 

Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
150 155 160 

Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
170 175 

Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
185 190 



Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
200 205 
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Asp Met Phe Arg Asp Arg Gin 
210 " 215 



Ala Phe Ser Glu His 
220 



Thr Trp Lys Met 



Leu Leu Ser Val Cys Arg Ser 
225 230 

Arg Lys Trp Phe Pro Ala Glu 
245 

Tyr Leu Gin Ala Arg Gly Leu 
260 

Gly Gin Leu Asn Met Leu His 
275 



Trp Ala Ala Trp Cys 
235 

Pro Glu Asp Val Arg 
250 

Ala Val Lys Thr He 
265 

Arg Arg Ser Gly Leu 
280 



Lys Leu Asn Asn 
240 

Asp Tyr Leu Leu 
255 

Gin Gin His Leu 
270 

Pro Arg Pro Ser 
285 



Asp Ser Asn Ala Val Ser Leu 
290 295 



Val Met Arg Arg He 
300 



Arg Lys Glu Asn 



Val Asp Ala Gly Glu Arg Ala 
305 310 

Asp Phe Asp Gin Val Arg Ser 
325 

Asp He Arg Asn Leu Ala Phe 
340 

Arg He Ala Glu He Ala Arg 
355 

Asp Gly Gly Arg Met Leu He 
370 375 

Ser Thr Ala Gly Val Glu Lys 
385 390 



Lys Gin Ala Leu Ala 
315 

Leu Met Glu Asn Ser 
330 

Leu Gly He Ala Tyr 
345 

He Arg Val Lys Asp 
360 

His He Gly Arg Thr 
380 

Ala Leu Ser Leu Gly 
395 . 



Phe Glu Arg Thr 
320 

Asp Arg Cys Gin 
335 

Asn Thr Leu Leu 
350 

He Ser Arg Thr 
365 

Lys Thr Leu Val 



Val Thr Lys Leu 
400 



Val Glu Arg Trp He Ser Val 
405 

Tyr Leu Phe Cys Arg Val Arg 
420 

Thr Ser Gin Leu Ser Thr Arg 
435 

His Arg Leu He Tyr Gly Ala 
450 455 

Ala Trp Ser Gly His Ser Ala 
465 470 



Ser Gly Val Ala Asp 
410 

Lys Asn Gly Val Ala 
425 

Ala Leu Glu Gly He 
440 

Lys Asp Asp Ser Gly 
460 

Arg Val Gly Ala Ala 
475 



Asp Pro Asn Asn 
415 

Ala Pro Ser Ala 
430 

Phe Glu Ala Thr 
445 

Gin Arg Tyr Leu 



Arg Asp Met Ala 
480 



Arg Ala Gly Val Ser He Pro 
485 

Asn Val Asn He Val Met Asn 
500 

Gly Ala Met Val Arg Leu Leu 
515 

Gly Ser Ala Trp Arg His Pro 
530 535 



Glu He Met Gin Ala 
490 

Tyr He Arg Asn Leu 
505 

Glu Asp Gly Asp Gly 
520 

Gin Phe Gly Gly 



Gly Gly Trp Thr 
4 95 

Asp Ser Glu Thr 
510 

He Glu Gly Arg 
525 
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<210> 15 
<211> 5953 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCRT7-deltaVPCS 

<400> 15 

cgatggattt ccgtctctgg tgtagctgat gatccgaata actacctgtt ttgccgggtc 60 
agaaaaaatg gtgttgccgc gccatctgcc accagccagc tatcaactcg cgccctggaa 120 
gggatttttg aagcaactca tcgattgatt tacggcgcta aggatgactc tggtcagaga 180 
tacctggcct ggtctggaca cagtgcccgt gtcggagccg cgcgagatat ggcccgcgct 24 0 
ggagtttcaa taccggagat catgcaagct ggtggctgga ccaatgtaaa tattgtcatg 300 
aactatatcc gtaacctgga tagtgaaaca ggggcaatgg tgcgcctgct ggaagatggc 360 
gatggtatcg aaggtcgtgg tagcgcttgg cgtcacccgc agttcggtgg ttaataagct 420 
tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat catcaccatc 480 
accattgagt tttgagcaat aactagcata accccttggg gcctctaaac gggtcttgag 540 
gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca 600 
tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa 660 
agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg 720 
ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc 780 
ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga 840 
tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact 900 
gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaattaattc 960 
ttagaaaaac tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat 1020 
accatatttt tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca 1080 
taggatggca agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc 114 0 
tattaatttc ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac 1200 
tgaatccggt gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca 1260 
gccattacgc tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg 1320 
cgcctgagcg agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga 1380 
atgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata 1440 
ttcttctaat acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc 1500 
atcaggagta cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt 1560 
tagtctgacc atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa 1620 
caactctggc gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac 1680 
attatcgcga gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg 174 0 
cctcgagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat 1800 
gtaagcagac agttttattg ttcatgacca aaatccctta acgtgagttt tcgttccact 1860 
gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg 1920 
taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc 1980 
aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata 204 0 
ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta 2100 
catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc 2160 
ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg 2220 
ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac 2280 
agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg 2340 
taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt 2400 
atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct 2460 
cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg 2520 
ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata 2580 
accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca 2640 
gcgagtcagt gagcgaggaa gcggaagagc gcctgatgcg gtattttctc cttacgcatc 2700 
tgtgcggtat ttcacaccgc atatatggtg cactctcagt acaatctgct ctgatgccgc 2760 
atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc t^cgccccga 2820 
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 2880 
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 2940 
aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg 3000 
tctgcctgtt catccgcgtc cagctcgttg. agtttctcca gaagcgttaa tgtctggctt 3060 
ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga tgcctccgtg 3120 
taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag gatgctcacg 3180 
atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg taaacaactg 3240 
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gcggtatgga 
aatacagatg 
ataatggtgc 
accattcatg 
tcgcgtatcg 
ctcaacgaca 
cgccgcgtgc 
tgcgcattca 
ttagcgaggt 
gcaacgcggg 
atgtgctcgc 
ggctggtaag 
tggacagcat 
tggggaaggc 
ccgccatgcc 
cgaaggcttg 
tcgcgctcca 
ctacgagttg 
cccaccggaa 
tatgcgactc 
ccgcaaggaa 
ccaccatacc 
catcggtgat 
ccacgatgcg 
tatagggaga 
tatacatatg 
aacccgatcc 
cccaaacccc 
cgccgcggtc 
catgtcgcgt 
cgtgacggtc 
cgtggtgcag 
caccgagcga 
taccgagctc 
cggccgcatg 
aacgagtgat 
tgagcatacc 
gaataaccgg 
tcaggcgcgc 
tcatcgtcgg 
gcggatccga 
acgcactgat 
acgtaatctg 
caggatcagg 
cagaacgaaa 
taaactggtc 



tgcggcggga 
taggtgttcc 
agggcgctga 
ttgttgctca 
gtgattcatt 
ggagcacgat 
ggctgctgga 
cagttctccg 
gccgccggct 
gaggcagaca 
cgaggcggca 
agccgcgagc 
ggcctgcaac 
catccagcct 
ggcgataatg 
agcgagggcg 
gcgaaagcgg 
catgataaag 
ggagctgact 
ctgcattagg 
tggtgcatgc 
cacgccgaaa 
gtcggcgata 
tccggcgtag 
ccacaacggt 
gctagcatga 
aagacacccg 
gacgcgccat 
gggcgcctgg 
ccgcgcacag 
tgcgagggca 
gacgtcgacg 
cctcgagccc 
ggatccacta 
tccaatttac 
gaggttcgca 
tggaaaatgc 
aaatggtttc 
ggtctggcag 
tccgggctgc 
aaagaaaacg 
ttcgaccagg 
gcatttctgg 
gttaaagata 
acgctggtta 
gag 



ccagagaaaa 
acagggtagc 
cttccgcgtt 
ggtcgcagac 
ctgctaacca 
catgcgcacc 
gatggcggac 
caagaattga 
tccattcagg 
aggtataggg 
taaatcgccg 
gatccttgaa 
gcgggcatcc 
cgcgtcgcga 
gcctgcttct 
tgcaagattc 
tcctcgccga 
aagacagtca 
gggttgaagg 
aagcagccca 
aaggagatgg 
caagcgctca 
taggcgccag 
aggatcgaga 
ttccctctag 
ctggtggaca 
cgcaggggct 
ggaccccccg 
cggccatgca 
acgaagacct 
aaaacctgct 
cggccacggc 
cagcccgctc 
gtccagtgtg 
tgaccgtaca 
agaacctgat 
ttctgtccgt 
ccgcagaacc 
taaaaactat 
cacgaccaag 
ttgatgccgg 
ttcgttcact 
ggattgctta 
tctcacgtac 
gcaccgcagg 



atcactcagg 
cagcagcatc 
tccagacttt 
gttttgcagc 
gtaaggcaac 
cgtggccagg 
gcgatggata 
.ttggctccaa 
tcgaggtggc 
cggcgcctac 
tgacgatcag 
gctgtccctg 
cgatgccgcc 
acgccagcaa 
cgccgaaacg 
cgaataccgc 
aaatgaccca 
taagtgcggc 
ctctcaaggg 
gtagtaggtt 
cgcccaacag 
tgagcccgaa 
caaccgcacc 
tctcgatccc 
aaataatttt 
gcaaatgggt 
ggccagaaag 
ggtggccggc 
tgcccggatg 
caacgaactc 
tcagcgcgcc 
gactcgaggg 
cgcttctcgc 
gtggaattct 
ccaaaatttg 
ggacatgttc 
ttgccggtcg 
tgaagatgtt 
ccagcaacat 
tgacagcaat 
tgaacgtgca 
catggaaaat 
taacaccctg 
tgacggtggg 
tgtagagaag 



gtcaatgcca 
ctgcgatgca 
acgaaacacg 
agcagtcgct 
cccgccagcc 
acccaacgct 
tgttctgcca 
ttcttggagt 
ccggctccat 
aatccatgcc 
cggtccagtg 
atggtcgtca 
ggaagcgaga 
gacgtagccc 
tttggtggcg 
aagcgacagg 
gagcgctgcc 
gacgatagtc 
catcggtcga 
gaggccgttg 
tcccccggcc 
gtggcgagcc 
tgtggcgccg 
gcgaaattaa 
gtttaacttt 
cgggatccgt 
ctgcacttta 
tttaacaagc 
gcggctgtcc 
cttggcatca 
aacgagttgg 
cgttctgcgg 
cccagacggc 
gcagatatcc 
cctgcattac 
agggatcgcc 
tgggcggcat 
cgcgattatc 
ttgggccagc 
gctgtttcac 
aaacaggctc 
agcgatcgct 
ttacgtatag 
agaatgttaa 
gcacttagcc 



gcgcttcgtt 
gatccggaac 
gaaaccgaag 
tcacgttcgc 
tagccgggtc 
gcccgagatg 
agggttggtt 
ggtgaatccg 
gcaccgcgac 
aacccgttcc 
atcgaagtta 
tctacctgcc 
agaatcataa 
agcgcgtcgg 
ggaccagtga 
ccgatcatcg 
ggcacctgtc 
atgccccgcg 
cgctctccct 
agcaccgccg 
acggggcctg 
cgatcttccc 
gtgatgccgg 
tacgactcac 
aagaaggaga 
cgacggcgcc 
gcaccgcccc 
gcgtcttctg 
agctctggga 
ccaccatccg 
tgaatccaga 
cgtcgcgccc 
ccgtcgaggg 
agcacagtgg 
cggtcgatgc 
aggcgttttc 
ggtgcaagtt 
ttctatatct 
taaacatgct 
tggttatgcg 
tagcgttcga 
gccaggatat 
ccgaaattgc 
tccatattgg 
tgggggtaac 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
5953 



<210> 16 
<211> 4727 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-TACS 

<400> 16 

atccggatat agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa 60 

ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt 120 

tgttagcagc cggatctcag tggtggtggt ggtggtgctc gagtgcggcc gcaagcttat 180 

taaccaccga actgcgggtg acgccaagcg' ctaccacgac cttcgatacc atcgccatct 240 

tccagcaggc gcaccattgc ccctgtttca ctatccaggt tacggatata gttcatgaca 300 

atatttacat tggtccagcc accagcttgc atgatctccg gtattgaaac tccagcgcgg 360 
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gccatatctc 
ccagagtcat 
gcgcgagttg 
caaaacaggt 
agtttagtta 
gttctgccaa 
atcctggcaa 
ttacgtatat 
gtgcgttcga 
atccgccgca 
cgatgaagca 
gcctgaagat 
ttattcaact 
tgctcagaaa 
ctcgttgcat 
ccgcggcggc 
gttaaacaaa 
atttcgcggg 
gctcctgtcg 
atcaccgata 
aacaacatga 
gccctgcacc 
acctacatct 
ccgcatccat 
atcagtaacc 
cagaaatccc 
ggcccgcttt 
ggatgaacag 
ctgcctcgcg 
ggtcacagct 
gggtgttggc 
tactggctta 
gtgaaatacc 
cgctcactga 
aggcggtaat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcatagctc 
gtgtgcacga 
agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 
cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 
gagaatagtg 
cgccacatag 
tctcaaggat 
gatcttcagc 
atgccgcaaa 
ttcaatatta 



gcgcggctcc 
ccttagcgcc 
atagctggct 
agttattcgg 
cccccaggct 
tatggattaa 
tttcggctat 
cctggcagcg 
acgctagagc 
taaccagtga 
tgtttagctg 
atagaagata 
tgcaccatgc 
acgcctggcg 
cgaccggtaa 
gttggcggcg 
attatttcta 
atcgagatct 
ttgaggaccc 
cgcgagcgaa 
atggtcttcg 
attatgttcc 
gtattaacga 
accgccagtt 
cgtatcgtga 
ccttacacgg 
atcagaagcc 
gcagacatct 
cgtttcggtg 
tgtctgtaag 
gggtgtcggg 
actatgcggc 
gcacagatgc 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 
ggtaagacac 
gtatgtaggc 
gacagtattt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttta 
cttaccgctg 
atcttttact 
aaagggaata 
ttgaagcatt 



gacacgggca 
gtaaatcaat 
ggtggcagat 
atcatcagct 
aagtgccttc 
cattctccca 
acgtaacagg 
atcgctattt 
ctgttttgca 
aacagcattg 
gcccaaatgt 
atcgcgaaca 
cgcccacgac 
atccctgaac 
tgcaggcaaa 
cttcttgcgg 
gagggaaacc 
cgggcagcgt 
ggctaggctg 
cgtgaagcga 
gtttccgtgt 
ggatctgcat 
agcgctggca 
gtttaccctc 
gcatcctctc 
aggcatcagt 
agacattaac 
gtgaatcgct 
atgacggtga 
cggatgccgg 
gcgcagccat 
atcagagcag 
gtaaggagaa 
tcggtcgttc 
acagaatcag 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
tatctcagtt 
cagcccgacc 
gacttatcgc 
ggtgctacag 
ggtatctgcg 
ggcaaacaaa 
agaaaaaaag 
aacgaaaact 
atccttttaa 
tctgacagtt 
tcatccatag 
tctggcccca 
gcaataaacc 
tccatccagt 
ttgcgcaacg 
gcttcattca 
aaaaaagcgg 
ttatcactca 
tgcttttctg 
ccgagttgct 
aaagtgctca 
ttgagatcca 
ttcaccagcg 
agggcgacac 
tatcagggtt 



ctgtgtccag 
cgatgagttg 
ggcgcggcaa 
acaccagaga 
tctacacctg 
ccgtcagtac 
gtgttataag 
tccatgagtg 
cgttcaccgg 
ctgtcacttg 
tgctggatag 
tcttcaggtt 
cggcaaacgg 
atgtccatca 
ttttggtgta 
ccgtagccca 
gttgtggtct 
tgggtcctgg 
gcggggttgc 
ctgctgctgc 
ttcgtaaagt 
cgcaggatgc 
ttgaccctga 
acaacgttcc 
tcgtttcatc 
gaccaaacag 
gcttctggag 
tcacgaccac 
aaacctctga 
gagcagacaa 
gacccagtca 
attgtactga 
aataccgcat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 
cactggcagc 
agttcttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 
tcattggaaa 
gttcgatgta 
tttctgggtg 
ggaaatgttg 
attgtctcat 



accaggccag 
cttcaaaaat 
caccattttt 
cggaaatcca 
cggtgctaac 
gtgagatatc 
caatccccag 
aacgaacctg 
catcaacgtt 
gtcgtggcag 
tttttactgt 
ctgcgggaaa 
acagaagcat 
ggttcttgcg 
cggtcagtaa 
tggtatatct 
ccctatagtg 
ccacgggtgc 
cttactggtt 
aaaacgtctg 
ctggaaacgc 
tgctggctac 
gtgatttttc 
agtaaccggg 
ggtatcatta 
gaaaaaaccg 
aaactcaacg 
gctgatgagc 
cacatgcagc 
gcccgtcagg 
cgtagcgata 
gagtgcacca 
caggcgctct 
agcggtatca 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
atccggtaac 
agccactggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctgcaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gtcaatacgg 
acgttcttcg 
acccactcgt 
agcaaaaaca 
aatactcata 
gagcggatac 



gtatctctga 
cccttccagg 
tctgacccgg 
tcgctcgacc 
cagcgttttc 
tttaaccctg 
aaatgccaga 
gtcgaaatca 
ttcttttcgg 
cccggaccga 
cagaccgcgc 
ccatttccgg 
tttccaggta 
aacctcatca 
attggacatg 
ccttcttaaa 
agtcgtatta 
gcatgatcgt 
agcagaatga 
cgacctgagc 
ggaagtcagc 
cctgtggaac 
tctggtcccg 
catgttcatc 
cccccatgaa 
cccttaacat 
agctggacgc 
tttaccgcag 
tcccggagac 
gcgcgteagc 
gcggagtgta 
tatatgcggt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 
aacaggatta 
aactacggct 
ttcggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcattct 
gataataccg 
gggcgaaaac 
gcacccaact 
ggaaggcaaa 
ctcttccttt 
atatttgaat 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 
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gtatttagaa aaataaacaa 
aaattgtaaa cgttaatatt 
tttttaacca ataggccgaa 
tagggttgag tgttgttcca 
acgtcaaagg gcgaaaaacc 
aatcaagttt tttggggtcg 
cccgatttag agcttgacgg 
cgaaaggagc gggcgctagg 
cacccgccgc gcttaatgcg 



51 

ataggggttc cgcgcacatt 
ttgttaaaat tcgcgttaaa 
atcggcaaaa tcccttataa 
gtttggaaca agagtccact 
gtctatcagg gcgatggccc 
aggtgccgta aagcactaaa 
ggaaagccgg cgaacgtggc 
gcgctggcaa gtgtagcggt 
ccgctacagg gcgcgtccca 



tccccgaaaa gtgccacctg 4260 
tttttgttaa atcagctcat 4320 
atcaaaagaa tagaccgaga 4 380 
attaaagaac gtggactcca 4440 
actacgtgaa ccatcaccct 4500 
tcggaaccct aaagggagcc 4560 
gagaaaggaa gggaagaaag 4 620 
cacgctgcgc gtaaccacca 4 680 
ttcgcca 4727 



<210> 17 
<211> 4488 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-VPCS 

<400> 17 

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 60 

gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 120 

gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 180 

cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 240 

gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 300 

gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 360 

ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 420 

tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 4 80 

ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 540 

cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 600 

accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca 660 

cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 720 

tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 780 

cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 840 

acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 900 

atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga 960 

tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 1020 

aaagtgccac ctgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg 1080 

cgtatcacga ggccctttcg tcttcaagaa ttaaaaggat ctaggtgaag atcctttttg 1140 

ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 1200 

tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 1260 

aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 1320 

tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 1380 

agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 1440 

taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 1500 

caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 1560 

agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 1620 

aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 1680 

gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 1740 

tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 1800 

gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 1860 

ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 1920 

ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 1980 

aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 204 0 

accgcatcag atctgatggt gcactctcag tacaatctgc tctgatgccg catagttaag 2100 

ccagtatata cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc 2160 

caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 2220 

ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 2280 

cgaggcccag cgattcgaac ttctgataga cttcgaaatt aatacgactc actataggga 2340 

gaccacaacg gtttccctct agaaataatt ttgtttaact ttaagaagga gatatacata 2400 

tgacctctcg ccgctccgtg aagtcgggtc' cgcgggaggt tccgcgcgat gagtacgagg 24 60 

atctgtacta caccccgtct tcaggtatgg cgagtcccga tagtccgcct gacacctccc 2520 

gccgtggcgc cctacagaca cgctcgcgcc agaggggcga ggtccgtttc gtccagtacg 2580 
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acgagtcgga 
tcccccggac 
cgcctccgcc 
ccccccgaac 
gcggcaggaa 
cgccaacccg 
cccccccaaa 
tctgcgccgc 
gggacatgtc 
tccgcgtgac 
cagacgtggt 
gccccaccga 
agggtaccga 
gtggcggccg 
atgcaacgag 
tttctgagca 
agttgaataa 
atcttcaggc 
tgcttcatcg 
tgcggcggat 
tcgaacgcac 
atatacgtaa 
ttgccaggat 
ttggcagaac 
taactaaact 
acctgttttg 
caactcgcgc 
atgactctgg 
gagatatggc 
atgtaaatat 
gcctgctgga 
tcggtggtta 



ttatgccctc 
gcggcgtccc 
acccgctggg 
ccagcgggtg 
atcggcccag 
atccaagaca 
ccccgacgcg 
ggtcgggcgc 
gcgtccgcgc 
ggtctgcgag 
gcaggacgtc 
gcgacctcga 
gctcggatcc 
catgtccaat 
tgatgaggtt 
tacctggaaa 
ccggaaatgg 
gcgcggtctg 
tcggtccggg 
ccgaaaagaa 
tgatttcgac 
tctggcattt 
cagggttaaa 
gaaaacgctg 
ggtcgagcga 
ccgggtcaga 
cctggaaggg 
tcagagatac 
ccgcgctgga 
tgtcatgaac 
agatggcgat 
ataagcttat 



tacgggggct 
gtttccgggg 
tccggagggg 
gcgtctaagg 
ccagaatccg 
cccgcgcagg 
ccatggaccc 
ctggcggcca 
acagacgaag 
ggcaaaaacc 
gacgcggcca 
gccccagccc 
actagtccag 
ttactgaccg 
cgcaagaacc 
atgcttctgt 
tttcccgcag 
gcagtaaaaa 
ctgccacgac 
aacgttgatg 
caggttcgtt 
ctggggattg 
gatatctcac 
gttagcaccg 
tggatttccg 
aaaaatggtg 
atttttgaag 
ctggcctggt 
gtttcaatac 
tatatccgta 
ggtatcgaag 
cgatgataag 



cgtcttccga 
cggttttgtc 
ccggacgcac 
cccccgcggc 
ccgcactccc 
ggctggccag 
cccgggtggc 
tgcatgcccg 
acctcaacga 
tgcttcagcg 
cggcgactcg 
gctccgcttc 
tgtggtggaa 
tacaccaaaa 
tgatggacat 
ccgtttgccg 
aacctgaaga 
ctatccagca 
caagtgacag 
ccggtgaacg 
cactcatgga 
cttataacac 
gtactgacgg 
caggtgtaga 
tctctggtgt 
ttgccgcgcc 
caactcatcg 
ctggacacag 
cggagatcat 
acctggatag 
gtcgtggtag 
ctgtcaaaca 



agacgacgaa 
cggcccgggg 
acccaccacc 
cccggcggcg 
agacgccccc 
aaagctgcac 
cggctttaac 
gatggcggct 
actccttggc 
cgccaacgag 
agggcgttct 
tcgccccaga 
ttctgcagat 
tttgcctgca 
gttcagggat 
gtcgtgggcg 
tgttcgcgat 
acatttgggc 
caatgctgtt 
tgcaaaacag 
aaatagcgat 
cctgttacgt 
tgggagaatg 
gaaggcactt 
agctgatgat 
atctgccacc 
attgatttac 
tgcccgtgtc 
gcaagctggt 
tgaaacaggg 
cgcttggcgt 
tgagaatt 



cacccggagg 
cctgcgcggg 
gccccccggg 
gagaccaccc 
gcgtcgacgg 
tttagcaccg 
aagcgcgtct 
gtccagctct 
atcaccacca 
ttggtgaatc 
gcggcgtcgc 
cggcccgtcg 
atccagcaca 
ttaccggtcg 
cgccaggcgt 
gcatggtgca 
tatcttctat 
cagctaaaca 
tcactggtta 
gctctagcgt 
cgctgccagg 
atagccgaaa 
ttaatccata 
agcctggggg 
ccgaataact 
agccagctat 
ggcgctaagg 
ggagccgcgc 
ggctggacca 
gcaatggtgc 
cacccgcagt 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4488 



<210> 18 
<211> 1125 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; DNA sequence 
coding for a fusion protein TATcreStrepTag 

<220> 

<221> CDS 

<222> (1) . . (1119) 

<400> 18 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 4 8 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 14 4 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 
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teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

ctg aca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 336 

Leu Thr Val Lys Thr He Gin Gin His Leu Gly Gin Leu Asn Met Leu 

100 105 110 

c * , j 

cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea 384 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gee ggt gaa cgt 432 

Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 4 80 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 * J 175 

ttt ctg ggg att get tat aac ace ctg tta cgt ata gee gaa att gee 576 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 

180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate cat att ggc aga acg aaa acg ctg gtt age ace gca ggt gta gag 672 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp lie Ser 
225 230 235 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 768 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 " 255 

aga aaa aat ggt gtt gee gcg cca tct gee ace age cag eta tea act 816 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 

260 265 270 

cgc gee ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gee tgg tct gga cac agt 912 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

gee cgt gtc gga gee gcg cga gat atg gec cgc get gga gtt tea ata 960 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 " 320 
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ccg gag ate atg caa get ggt ggc tgg acc aat gta aat att gtc atg 1008 
Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat ggt ate gaa ggt cgt ggt age get tgg cgt cac 1104 
Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

ccg cag ttc ggt ggt taataa 1125 
Pro Gin Phe Gly Gly 
370 



<210> 19 
<211> 373 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 

<400> 19 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 " 95 

Leu Thr Val Lys Thr He Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 * 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 
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He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 " 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu lie Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 

Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

Pro Gin Phe Gly Gly 
370 



<210> 20 
<211> 2055 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<220> 

<221> CDS 

<222> (1) . . (2049) 

<400> 20 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

ccc gat agt ccg cct gac acc tec cgc cgt ggc gee eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 ~ 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 
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tat gcc etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 240 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gcc gga 336 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 " 110 

cgc aca ccc acc acc gcc ccc egg gcc ccc cga acc cag egg gtg gcg 384 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 

115 120 125 

tct aag gcc ccc gcg gcc ccg gcg gcg gag acc acc cgc ggc agg aaa 432 

Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gcc. cag cca gaa tec. gcc gca etc cca gac gcc ccc gcg teg acg 480 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gcc aga aag ctg 528 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gcc ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 624 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 

195 200 205 

gcg gcc atg cat gcc egg atg gcg get gtc cag etc tgg gac atg teg 672 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 ■ 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gcc aac 7 68 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gcc acg gcg 816 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gcc 864 

Thr Arg Gly Arg Ser Ala Ala Ser Arg ' Pro Thr Glu Arg Pro Arg Ala 

275 280 285 . 

cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 
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agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 . 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

. L.J 

ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 
Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 "395 " 400 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 1392 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 1488 
Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

cgt ata gee gaa att gec agg ate agg gtt aaa gat ate tea cgt act 1536 
Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 505 " * 510 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1584 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gee gcg cca tct gec 1728 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 
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acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 177 6 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gec tgg tct gga cac agt gec cgt gtc gga gec gcg cga gat atg gec 1872 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 

610 615 620 

J _j 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1920 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 

625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn lie Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 " 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 2016 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 670 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 2055 
Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 

<210> 21 
<211> 683 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<400> 21 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
.50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 HO/"" 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 
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Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 .._ 

J f 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp ' Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys . Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 "* 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 



Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 
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Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg He Ala Glu He Ala Arg lie Arg Val Lys Asp He Ser Arg Thr 
500 505 510 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

i 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 " 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 670 



Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 



<210> 22 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic TAT 
protein 

<400> 22 

Ala Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
1 5 10 



<210> 23 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 
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<400> 23 

Tyr Ma Arg Lys Ala Arg Arg Gin Ala Arg Arg 
15 10 



<210> 24 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 24 

Tyr Ala Arg Ala Ala Ala Arg Gin Ala Arg Ala 
15 10 



<210> 25 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 25 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Arg 
15 10 



<210> 26 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 26 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Ala 
15 10 



<210> 27 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 27 

Tyr Ala Arg Arg Arg Arg Arg Arg Arg Arg Arg 
1 5 10 



<210> 28 
<211> 11 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 28 

Tyr Ala Ala Ala Arg Arg Arg Arg Arg Arg Arg 
15 10 



<210> 29 

<211> 4960 - ' 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCMV-I-Cre-pA 

<400> 29 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 

tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 

cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 

atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 

tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300 

cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360 

tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 

cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 480 

ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 540 

aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 

gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 

attaatacga ctcactatag ggagacccaa gctgactcta gacttaatta agcgttgggg 720 

tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 

cgaggaggat ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat 840 

ctggtcagaa aagacaatct ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg 900 

gccatacact tgagtgacat tgacatccac tttgcctttc tctccacagg tgtccactcc 960 

cagggcggcc tcgaccatgc ccaagaagaa gaggaaggtg tccaatttac tgaccgtaca 1020 

ccaaaatttg cctgcattac cggtcgatgc aacgagtgat gaggttcgca agaacctgat 1080 

ggacatgttc agggatcgcc aggcgttttc tgagcatacc tggaaaatgc ttctgtccgt 1140 

ttgccggtcg tgggcggcat ggtgcaagtt gaataaccgg aaatggtttc ccgcagaacc 1200 

tgaagatgtt cgcgattatc ttctatatct tcaggcgcgc ggtctggcag taaaaactat 1260 

ccagcaacat ttgggccagc taaacatgct tcatcgtcgg tccgggctgc cacgaccaag 1320 

tgacagcaat gctgtttcac tggttatgcg gcggatccga aaagaaaacg ttgatgccgg 1380 

tgaacgtgca aaacaggctc tagcgttcga acgcactgat ttcgaccagg ttcgttcact 1440 

catggaaaat agcgatcgct gccaggatat acgtaatctg gcatttctgg ggattgctta 1500 

taacaccctg ttacgtatag ccgaaattgc caggatcagg gttaaagata tctcacgtac 1560 

tgacggtggg agaatgttaa tccatattgg cagaacgaaa acgctggtta gcaccgcagg 1620 

tgtagagaag gcacttagcc tgggggtaac taaactggtc gagcgatgga tttccgtctc 1680 

tggtgtagct gatgatccga ataactacct gttttgccgg gtcagaaaaa atggtgttgc 1740 

cgcgccatct gccaccagcc agctatcaac tcgcgccctg gaagggattt ttgaagcaac ' 1800 

tcatcgattg atttacggcg ctaaggatga ctctggtcag agatacctgg cctggtctgg 1860 

acacagtgcc cgtgtcggag ccgcgcgaga tatggcccgc gctggagttt caataccgga 1920 

gatcatgcaa gctggtggct ggaccaatgt aaatattgtc atgaactata tccgtaacct 1980 

ggatagtgaa acaggggcaa tggtgcgcct gctggaagat ggcgattagc cattaacgcg 2040 

taaatgattg cagatccact agttctaggg ccgcgtcgac ctcgagatcc aggcgcggat 2100 

caataaaaga tcattatttt caatagatct gtgtgttggt tttttgtgtg ccttggggga 2160 

gggggaggcc agaatgaggc gcggccaagg gggaggggga ggccagaatg accttggggg 2220 

agggggaggc cagaatgacc ttgggggagg gggaggccag aatgaggcgc gcccccgggt 2280 

accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 2340 

tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 24 00 

ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat 24 60 

gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 2520 

tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga 2580 
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cgcgccctga cgggcttgtc 
cgggagctgc atgtgtcaga 
cctcgtgata cgcctatttt 
aggtggcact tttcggggaa 
ttcaaatatg tatccgctca 
aaggaagagt atgagtattc 
ttgccttcct gtttttgctc 
gttgggtgca cgagtgggtt 
ttttcgcccc gaagaacgtt 
ggtattatcc cgtattgacg 
gaatgacttg gttgagtact 
aagagaatta tgcagtgctg 
gacaacgatc ggaggaccga 
aactcgcctt gatcgttggg 
caccacgatg cctgtagcaa 
tactctagct tcccggcaac 
acttctgcgc tcggcccttc 
gcgtgggtct cgcggtatca 
agttatctac acgacgggga 
gataggtgcc tcactgatta 
ttagattgat ttaaaacttc 
taatctcatg accaaaatcc 
agaaaagatc aaaggatctt 
aacaaaaaaa ccaccgctac 
ttttccgaag gtaactggct 
gccgtagtta ggccaccact 
aatcctgtta ccagtggctg 
aagacgatag ttaccggata 
gcccagcttg gagcgaacga 
aagcgccacg cttcccgaag 
aacaggagag cgcacgaggg 
cgggtttcgc cacctctgac 
cctatggaaa aacgccagca 
tgctcacatg ttctttcctg 
tgagtgagct gataccgctc 
ggaagcggaa gagcgcccaa 
atgcagctgg cacgacaggt 
tgtgagttag ctcactcatt 
gttgtgtgga attgtgagcg 
cgccaagcta gcccgggcta 



63 

tgctcccggc atccgcttac 
ggttttcacc gtcatcaccg 
tataggttaa tgtcatgata 
atgtgcgcgg aacccctatt 
tgagacaata accctgataa 
aacatttccg tgtcgccctt. 
acccagaaac gctggtgaaa 
acatcgaact ggatctcaac 
ttccaatgat gagcactttt 
ccgggcaaga gcaactcggt 
caccagtcac agaaaagcat 
ccataaccat gagtgataac 
aggagctaac cgcttttttg 
aaccggagct gaatgaagcc 
tggcaacaac gttgcgcaaa 
aattaataga ctggatggag 
cggctggctg gtttattgct 
ttgcagcact ggggccagat 
gtcaggcaac tatggatgaa 
agcattggta actgtcagac 
atttttaatt taaaaggatc 
cttaacgtga gttttcgttc 
cttgagatcc tttttttctg 
cagcggtggt ttgtttgccg 
tcagcagagc gcagatacca 
tcaagaactc tgtagcaccg 
ctgccagtgg cgataagtcg 
aggcgcagcg gtcgggctga 
cctacaccga actgagatac 
ggagaaaggc ggacaggtat 
agcttccagg gggaaacgcc 
ttgagcgtcg atttttgtga 
acgcggcctt tttacggttc 
cgttatcccc tgattctgtg 
gccgcagccg aacgaccgag 
tacgcaaacc gcctctcccc 
ttcccgactg gaaagcgggc 
aggcacccca ggctttacac 
gataacaatt tcacacagga 
gcttgcatgc ctgcaggttt 



agacaagctg tgaccgtctc 2640 
aaacgcgcga gacgaaaggg 2700 
ataatggttt cttagacgtc 2760 
tgtttatttt tctaaataca 2820 
atgcttcaat aatattgaaa 2880 
attccctttt ttgcggcatt 2940 
gtaaaagatg ctgaagatca 3000 
agcggtaaga tccttgagag 3060 
aaagttctgc tatgtggcgc 3120 
cgccgcatac actattctca 3180 
cttacggatg gcatgacagt 3240 
actgcggcca, acttacttct 3300 
cacaacatgg gggatcatgt 3360 
ataccaaacg acgagcgtga 3420 
ctattaactg gcgaactact 3480 
gcggataaag ttgcaggacc 3540 
gataaatctg gagccggtga 3600 
ggtaagccct cccgtatcgt 3660 
cgaaatagac agatcgctga 3720 
caagtttact catatatact 3780 
taggtgaaga tcctttttga 3840 
cactgagcgt cagaccccgt 3900 
cgcgtaatct gctgcttgca 3960 
gatcaagagc taccaactct 4020 
aatactgtcc ttctagtgta 4080 
cctacatacc tcgctctgct 4140 
tgtcttaccg ggttggactc 4200 
acggggggtt cgtgcacaca 4260 
ctacagcgtg agctatgaga 4320 
ccggtaagcg gcagggtcgg 4380 
tggtatcttt atagtcctgt 4440 
tgctcgtcag gggggcggag 4500 
ctggcctttt gctggccttt 4560 
gataaccgta ttaccgcctt 4 620 
cgcagcgagt cagtgagcga 4 680 
gcgcgttggc cgattcatta 4740 
agtgagcgca acgcaattaa 4 800 
tttatgcttc cggctcgtat 48 60 
aacagctatg accatgatta 4 920 

4960 



<210> 30 
<211> 7332 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCMV-I-beta-pA 



<400> 30 

aaacagtccg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gacttaatta 



tagttattaa 60 
cgttacataa 120 
gacgtcaata 180 
atgggtggac 240 
aagtacgccc 300 
catgacctta 360 
catggtgatg 420 
atttccaagt 480 
ggactttcca 540 
acggtgggag 600 
gcttatcgaa 660 
agcgttgggg 720 
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tgagtactcc ctctcaaaag cgggcatgac 
cgaggaggat ttgatattca cctggcccgc 
ctggtcagaa aagacaatct ttttgttgtc 
gccatacact tgagtgacat tgacatccac 
cagggcggcc gcaattcccg gggatcgaaa 
tgtcgtttac tttgaccaac aagaacgtga 
tggacaccag caaggagctg ctcaagcgcg 
aaaaccctgg cgttacccaa cttaatcgcc 
gtaatagcga agaggcccgc accgatcgcc 
aatggcgctt tgcctggttt ccggcaccag 
atcttcctga ggccgatact gtcgtcgtcc 
cgcccatcta caccaacgta acctatccca 
agaatccgac gggttgttac tcgctcacat 
gccagacgcg aattattttt gatggcgtta 
gctgggtcgg ttacggccag gacagtcgtt 
tacgcgccgg agaaaaccgc ctcgcggtga 
tggaagatca ggatatgtgg cggatgagcg 
aaccgactac acaaatcagc gatttccatg 
gcgctgtact ggaggctgaa gttcagatgt 
cagtttcttt atggcagggt gaaacgcagg 
aaattatcga tgagcgtggt ggttatgccg 
acccgaaact gtggagcgcc gaaatcccga 
ccgccgacgg cacgctgatt gaagcagaag 
ttgaaaatgg tctgctgctg ctgaacggca 
acgagcatca tcctctgcat ggtcaggtca 
tgctgatgaa gcagaacaac tttaacgccg 
tgtggtacac gctgtgcgac cgctacggcc 
cccacggcat ggtgccaatg aatcgtctga 
gcgaacgcgt aacgcgaatg gtgcagcgcg 
cgctggggaa tgaatcaggc cacggcgcta 
ctgtcgatcc ttcccgcccg gtgcagtatg 
atattatttg cccgatgtac gcgcgcgtgg 
aatggtccat caaaaaatgg ctttcgctac 
aatacgccca cgcgatgggt aacagtcttg 
gtcagtatcc ccgtttacag ggcggcttcg 
aatatgatga aaacggcaac ccgtggtcgg 
acgatcgcca gttctgtatg aacggtctgg 
tgacggaagc aaaacaccag cagcagtttt 
aagtgaccag cgaatacctg ttccgtcata 
cgctggatgg taagccgctg gcaagcggtg 
aacagttgat tgaactgcct gaactaccgc 
cagtacgcgt agtgcaaccg aacgcgaccg 
ggcagcagtg gcgtctggcg gaaaacctca 
tcccgcatct gaccaccagc gaaatggatt 
aatttaaccg ccagtcaggc tttctttcac 
tgacgccgct gcgcgatcag ttcacccgtg 
aagcgacccg cattgaccct aacgcctggg 
aggccgaagc agcgttgttg cagtgcacgg 
cgaccgctca cgcgtggcag catcagggga 
ggattgatgg tagtggtcaa atggcgatta 
cgcatccggc gcggattggc ctgaactgcc 
ggctcggatt agggccgcaa gaaaactatc 
gctgggatct gccattgtca gacatgtata 
tgcgctgcgg gacgcgcgaa ttgaattatg 
tcaacatcag ccgctacagt caacagcaac 
acgcggaaga aggcacatgg ctgaatatcg 
actcctggag cccgtcagta tcggcggaat 
agttggtctg gtgtcaaaaa taataataac 
taaggaaatc cattatgtac tatttaaaaa 
ttttctttta cttttttatc atgggagcct 
acatcaacca tatcagcaaa agtgatacgg 
cgctattatt ccaaccgctg tttggtctgc 
ggccgcgtcg acctcgagat ccaggcgcgg 
ctgtgtgttg gttttttgtg tgccttgggg 



64 

ttctgcgcta agattgtcag tttccaaaaa 780 
ggtgatgcct ttgagggtgg ccgcgtccat 84 0 
aagcttgagg tgtggcaggc ttgagatctg 900 
tttgcctttc tctccacagg tgtccactcc 960 
gagcctgcta aagcaaaaaa gaagtcacca 1020 
ttttcgttgc cggtctggga ggcattggtc 1080 
atcccgtcgt tttacaacgt cgtgactggg 1140 
ttgcagcaca tccccctttc gccagctggc 1200 
cttcccaaca gttgcgcagc ctgaatggcg 1260 
aagcggtgcc ggaaagctgg ctggagtgcg 1320 
cctcaaactg gcagatgcac ggttacgatg 1380 
ttacggtcaa tccgccgttt gttcccacgg 1440 
ttaatgttga tgaaagctgg ctacaggaag 1500 
actcggcgtt tcatctgtgg tgcaacgggc 1560 
tgccgtctga atttgacctg agcgcatttt 1620 
tggtgctgcg ttggagtgac ggcagttatc 1680 
gcattttccg tgacgtctcg ttgctgcata 1740 
ttgccactcg ctttaatgat gatttcagcc 1800 
gcggcgagtt gcgtgactac ctacgggtaa 1860 
tcgccagcgg caccgcgcct ttcggcggtg 1920 
atcgcgtcac actacgtctg aacgtcgaaa 1980 
atctctatcg tgcggtggtt gaactgcaca 2040 
cctgcgatgt cggtttccgc gaggtgcgga 2100 
agccgttgct gattcgaggc gttaaccgtc 2160 
tggatgagca gacgatggtg caggatatcc 2220 
tgcgctgttc gcattatccg aaccatccgc 2280 
tgtatgtggt ggatgaagcc aatattgaaa 2340 
ccgatgatcc gcgctggcta ccggcgatga 2400 
atcgtaatca cccgagtgtg atcatctggt 24 60 
atcacgacgc gctgtatcgc tggatcaaat 2520 
aaggcggcgg agccgacacc acggccaccg 2580 
atgaagacca gcccttcccg gctgtgccga 2640 
ctggagagac gcgcccgctg atcctttgcg 2700 
gcggtttcgc taaatactgg caggcgtttc 27 60 
tctgggactg ggtggatcag tcgctgatta 2820 
cttacggcgg tgattttggc gatacgccga 2880 
tctttgccga ccgcacgccg catccagcgc 2940 
tccagttccg tttatccggg caaaccatcg 3000 
gcgataacga gctcctgcac tggatggtgg 3060 
aagtgcctct ggatgtcgct ccacaaggta 3120 
agccggagag cgccgggcaa ctctggctca 3180 
catggtcaga agccgggcac atcagcgcct 3240 
gtgtgacgct ccccgccgcg tcccacgcca 3300 
tttgcatcga gctgggtaat aagcgttggc 3360 
agatgtggat tggcgataaa aaacaactgc 3420 
caccgctgga taacgacatt ggcgtaagtg 3480 
tcgaacgctg gaaggcggcg ggccattacc 3540 
cagatacact tgctgatgcg gtgctgatta 3600 
aaaccttatt tatcagccgg aaaacctacc 3660 
ccgttgatgt tgaagtggcg agcgatacac 3720 
agctggcgca ggtagcagag cgggtaaact 3780 
ccgaccgcct tactgccgcc tgttttgacc 3840 
ccccgtacgt cttcccgagc gaaaacggtc 3900 
gcccacacca gtggcgcggc gacttccagt 3960 
tgatggaaac cagccatcgc catctgctgc 4020 
acggtttcca tatggggatt ggtggcgacg 4080 
tacagctgag cgccggtcgc taccattacc 4140 
cgggcaggcc atgtctgccc gtatttcgcg 4200 
acacaaactt ttggatgttc ggtttattct 4260 
acttcccgtt tttcccgatt tggctacatg 4320 
gtattatttt tgccgctatt tctctgttct 4 380 
' tttctgacaa actcggcctc gactctaggc 4440 
atcaataaaa gatcattatt ttcaatagat 4500 
gagggggagg ccagaatgag gcgcggccaa 4560 
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gggggagggg 
gggggaggcc 
ttacaacgtc 
ccccctttcg 
ttgcgcagcc 
ggtatttcac 
agccagcccc 
gcatccgctt 
ccgtcatcac 
aatgtcatga 
ggaaccccta 
taaccctgat 
cgtgtcgccc 
acgctggtga 
ctggatctca 
atgagcactt 
gagcaactcg 
acagaaaagc 
atgagtgata 
accgcttttt 
ctgaatgaag 
acgttgcgca 
gactggatgg 
tggtttattg 
ctggggccag 
actatggatg 
taactgtcag 
tttaaaagga 
gagttttcgt 
cctttttttc 
gtttgtttgc 
gcgcagatac 
tctgtagcac 
ggcgataagt 
cggtcgggct 
gaactgagat 
gcggacaggt 
gggggaaacg 
cgatttttgt 
tttttacggt 
cctgattctg 
cgaacgaccg 
ccgcctctcc 
tggaaagcgg 
caggctttac 
tttcacacag 
gcctgcaggt 



gaggccagaa 
agaatgaggc 
gtgactggga 
ccagctggcg 
tgaatggcga 
accgcatatg 
gacacccgcc 
acagacaagc 
cgaaacgcgc 
taataatggt 
tttgtttatt 
aaatgcttca 
ttattccctt 
aagtaaaaga 
acagcggtaa 
ttaaagttct 
gtcgccgcat 
atcttacgga 
acactgcggc 
tgcacaacat 
ccataccaaa 
aactattaac 
aggcggataa 
ctgataaatc 
atggtaagcc 
aacgaaatag 
accaagttta 
tctaggtgaa 
tccactgagc 
tgcgcgtaat 
cggatcaaga 
caaatactgt 
cgcctacata 
cgtgtcttac 
gaacgggggg 
acctacagcg 
atccggtaag 
cctggtatct 
gatgctcgtc 
tcctggcctt 
tggataaccg 
agcgcagcga 
ccgcgcgttg 
gcagtgagcg 
actttatgct 
gaaacagcta 
tt 



tgaccttggg 
gcgcccccgg 
aaaccctggc 
taatagcgaa 
atggcgcctg 
gtgcactctc 
aacacccgct 
tgtgaccgtc 
gagacgaaag 
ttcttagacg 
tttctaaata 
ataatattga 
ttttgcggca 
tgctgaagat 
gatccttgag 
gctatgtggc 
acactattct 
tggcatgaca 
caacttactt 
gggggatcat 
cgacgagcgt 
tggcgaacta 
agttgcagga 
tggagccggt 
ctcccgtatc 
acagatcgct 
ctcatatata 
gatccttttt 
gtcagacccc 
ctgctgcttg 
gctaccaact 
ccttctagtg 
cctcgctctg 
cgggttggac 
ttcgtgcaca 
tgagctatga 
cggcagggtc 
ttatagtcct 
aggggggcgg 
ttgctggcct 
tattaccgcc 
gtcagtgagc 
gccgattcat 
caacgcaatt 
tccggctcgt 
tgaccatgat 



ggagggggag 
gtaccgagct 
gttacccaac 
gaggcccgca 
atgcggtatt 
agtacaatct 
gacgcgccct 
tccgggagct 
ggcctcgtga 
tcaggtggca 
cattcaaata 
aaaaggaaga 
ttttgccttc 
cagttgggtg 
agttttcgcc 
gcggtattat 
cagaatgact 
gtaagagaat 
ctgacaacga 
gtaactcgcc 
gacaccacga 
cttactctag 
ccacttctgc 
gagcgtgggt 
gtagttatct 
gagataggtg 
ctttagattg 
gataatctca 
gtagaaaaga 
caaacaaaaa 
ctttttccga 
tagccgtagt 
ctaatcctgt 
tcaagacgat 
cagcccagct 
gaaagcgcca 
ggaacaggag 
gtcgggtttc 
agcctatgga 
tttgctcaca 
tttgagtgag 
gaggaagcgg 
taatgcagct 
aatgtgagtt 
atgttgtgtg 
tacgccaagc 



gccagaatga 
cgaattcact 
ttaatcgcct 
ccgatcgccc 
ttctccttac 
gctctgatgc 
gacgggcttg 
gcatgtgtca 
tacgcctatt 
cttttcgggg 
tgtatccgct 
gtatgagtat 
ctgtttttgc 
cacgagtggg 
ccgaagaacg 
cccgtattga 
tggttgagta 
tatgcagtgc 
tcggaggacc 
ttgatcgttg 
tgcctgtagc 
cttcccggca 
gctcggccct 
ctcgcggtat 
acacgacggg 
cctcactgat 
atttaaaact 
tgaccaaaat 
tcaaaggatc 
aaccaccgct 
aggtaactgg 
taggccacca 
taccagtggc 
agttaccgga 
tggagcgaac 
cgcttcccga 
agcgcacgag 
gccacctctg 
aaaacgccag 
tgttctttcc 
ctgataccgc 
aagagcgccc 
ggcacgacag 
agctcactca 
gaattgtgag 
tagcccgggc 



ccttggggga 
ggccgtcgtt 
tgcagcacat 
ttcccaacag 
gcatctgtgc 
cgcatagtta 
tctgctcccg 
gaggttttca 
tttataggtt 
aaatgtgcgc 
catgagacaa 
tcaacatttc 
tcacccagaa 
ttacatcgaa 
ttttccaatg 
cgccgggcaa 
ctcaccagtc 
tgccataacc 
gaaggagcta 
ggaaccggag 
aatggcaaca 
acaattaata 
tccggctggc 
cattgcagca 
gagtcaggca 
taagcattgg 
tcatttttaa 
cccttaacgt 
ttcttgagat 
accagcggtg 
cttcagcaga 
cttcaagaac 
tgctgccagt 
taaggcgcag 
gacctacacc 
agggagaaag 
ggagcttcca 
acttgagcgt 
caacgcggcc 
tgcgttatcc 
tcgccgcagc 
aatacgcaaa 
gtttcccgac 
ttaggcaccc 
cggataacaa 
tagcttgcat 



4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7332 



<210> 31 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgccatggg ctacggccgc aagaagcgcc gccaacgccg ccgcggcatg tccaatttac 60 
tgaccgtaca cc 72 



<210> 32 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

tttcggatcc gccgcataac cagtg 



<210> 33 
<211> 34 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 33 

tatatctaga ccatgggcta cggccgcaag aagc 

<210> 34 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 34 

gctaccacga ccttcgatac catcgccatc ttccagcagg cgc 

<210> 35 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

taactagcgg ccgcatgtcc aatttactga ccgtacac 

<210> 36 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

tcgagcggcc gccatcgcca tcttccagca ggcg 

<210> 37 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: primer 
<400> 37 

tatatctaga catatgacct ctcgccgctc eg 



<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

ttccgaagac gacgaaacac c 21 

<210> 39 
<211> 32 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

tatattcgaa gcttattaac caccgaactg eg 32 



<210> 40 
<211> 4847 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pGK-cre-pA 

<400> 40 

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 60 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 120 
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgeggcatt 180 
ttgccttcct gtttttgetc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 240 
gttgggtgca cgagtgggtt acatcgaact ggatctcaac ageggtaaga tccttgagag 300 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 360 
ggtattatcc cgtattgacg cegggcaaga geaacteggt cgccgcatac actattctca 420 
gaatgacttg gttgagtact caccagtcac agaaaagcat ettaeggatg gcatgacagt 480 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 540 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 600 
aactcgcctt gatcgttggg aaceggaget gaatgaagee ataccaaacg acgagcgtga 660 
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 720 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgeaggace 780 
acttctgcgc tcggcccttc cggctggctg gtttattget gataaatctg gagccggtga 840 
gcgtgggtct cgeggtatea ttgeagcact ggggecagat ggtaagcect cccgtatcgt 900 
agttatctac aegaegggga gtcaggcaac tatggatgaa cgaaatagac agategctga 960 
gataggtgee tcactgatta agcattggta actgtcagac caagtttact catatatact 1020 
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 1080 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagegt cagaccccgt 114 0 
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgegtaatet getgettgea 1200 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 1260 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 1320 
geegtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 1380 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 1440 
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aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa 
atgcagctgg 
tgtgagttag 
gttgtgtgga 
cgccaagcgc 
cgaggtcgac 
cccaaggcag 
ggcctctggc 
gtggcccctt 
gcgtcgtgca 
agcaccgctg 
tccttcgctt 
gggctcaggg 
aaaagcgcac 
gctcgaggtc 
aaaatttgcc 
acatgttcag 
gccggtcgtg 
aagatgttcg 
agcaacattt 
acagcaatgc 
aacgtgcaaa 
tggaaaatag 
acaccctgtt 
acggtgggag 
tagagaaggc 
gtgtagctga 
cgccatctgc 
atcgattgat 
acagtgcccg 
tcatgcaagc 
atagtgaaac 
aatgattgca 
tgccagccat 
cccactgtcc 
tctattctgg 
aggcatgctg 
tcgagatcca 
ttcgccctat 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
cagcgtgacc 
ctttctcgcc 
gttccgattt 
acgtagtggg 
ctttaatagt 
ttttgattta 
acaaaaattt 



ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 
gcaattaacc 
ggtatcgata 
tctggagcat 
ctcgcacaca 
cgcgccactt 
ggacgtgaca 
agcaatggaa 
tctgggctca 
gcgggcgggc 
gtctgccgcg 
gaccatgccc 
tgcattaccg 
ggatcgccag 
ggcggcatgg 
cgattatctt 
gggccagcta 
tgtttcactg 
acaggctcta 
cgatcgctgc 
acgtatagcc 
aatgttaatc 
acttagcctg 
tgatccgaat 
caccagccag 
ttacggcgct 
tgtcggagcc 
tggtggctgg 
aggggcaatg 
gatccactag 
ctgttgtttg 
tttcctaata 
ggggtggggt 
gggatgcggt 
ctagttctag 
agtgagtcgt 
cctggcgtta 
agcgaagagg 
gacgcgccct 
gctacacttg 
acgttcgccg 
agtgctttac 
ccatcgccct 
ggactcttgt 
taagggattt 
aacgcgaatt 



aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 
ctcactaaag 
agcttgatat 
gcgctttagc 
ttccacatcc 
ctactcctcc 
aatggaagta 
gcgggtaggc 
gaggctggga 
gcccgaaggt 
ctgttctcct 
aagaagaaga 
gtcgatgcaa 
gcgttttctg 
tgcaagttga 
ctatatcttc 
aacatgcttc 
gttatgcggc 
gcgttcgaac 
caggatatac 
gaaattgcca 
catattggca 
ggggtaacta 
aactacctgt 
ctatcaactc 
aaggatgact 
gcgcgagata 
accaatgtaa 
gtgcgcctgc 
ttctagagct 
cccctccccc 
aaatgaggaa 
ggggcaggac 
gggctctatg 
cctcgaggct 
attacgcgcg 
cccaacttaa 
cccgcaccga 
gtagcggcgc 
ccagcgccct 
gctttccccg 
ggcacctcga 
gatagacggt 
tccaaactgg 
tgccgatttc 
ttaacaaaat 



gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattctgtg 
aacgaccgag 
gcctctcccc 
gaaagcgggc 
ggctttacac 
tcacacagga 
ggaacaaaag 
cgaattctac 
agccccgctg 
accggtagcg 
cctagtcagg 
gcacgtctca 
ctttggggca 
aggggtgggt 
cctcccgagg 
cttcctcatc 
ggaaggtgtc 
cgagtgatga 
agcatacctg 
ataaccggaa 
aggcgcgcgg 
atcgtcggtc 
ggatccgaaa 
gcactgattt 
gtaatctggc 
ggatcagggt 
gaacgaaaac 
aactggtcga 
tttgccgggt 
gcgccctgga 
ctggtcagag 
tggcccgcgc 
atattgtcat 
tggaagatgg 
cgctgatcag 
gtgccttcct 
attgcatcgc 
agcaaggggg 
gcttctgagn 
agagcggccg 
ctcactggcc 
tcgccttgca 
tcgcccttcc 
attaagcgcg 
agcgcccgct 
tcaagctcta 
ccccaaaaaa 
ttttcgccct 
aacaacactc 
ggcctattgg 
attaacgctt 



acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 
ctgggtaccg 
cgggtagggg 
gcacttggcg 
ccaaccggct 
aagtttcccc 
ctagtctcgt 
gcggccaata 
ccgggggcgg 
cccggcattc 
tccgggcctt 
caatttactg 
ggttcgcaag 
gaaaatgctt 
atggtttccc 
tctggcagta 
cgggctgcca 
agaaaacgtt 
cgaccaggtt 
atttctgggg 
taaagatatc 
gctggttagc 
gcgatggatt 
cagaaaaaat 
agggattttt 
atacctggcc 
tggagtttca 
gaactatatc 
cgattagcca 
cctcgactgt 
tgaccctgga 
attgtctgag 
aggattggga 
nngaaagaac 
ccaccgcggt 
gtcgttttac 
gcacatcccc 
caacagttgc 
gcgggtgtgg 
cctttcgctt 
aatcgggggc 
cttgattagg 
ttgacgttgg 
aaccctatct 
ttaaaaaatg 
acaattt 



cgtgcacaca 
agctatgaga 
gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 
ttaccgcctt 
cagtgagcga 
cgattcatta 
acgcaattaa 
cggctcgtat 
accatgatta 
ggccccccct 
aggcgctttt 
ctacacaagt 
ccgttctttg 
cagcaagctc 
gcagatggac 
gcagctttgt 
gctcaggggc 
tgcacgcttc 
tcgacctgca 
accgtacacc 
aacctgatgg 
ctgtccgttt 
gcagaacctg 
aaaactatcc 
cgaccaagtg 
gatgccggtg 
cgttcactca 
attgcttata 
tcacgtactg 
accgcaggtg 
tccgtctctg 
ggtgttgccg 
gaagcaactc 
tggtctggac 
ataccggaga 
cgtaacctgg 
ttaacgcgta 
gccttctagt 
aggtgccact 
taggtgtcat 
agacaatagc 
cagctggggc 
ggagctccaa 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
tggttacgcg 
tcttcccttc 
tccctttagg 
gtgatggttc 
agtccacgtt 
cggtctattc 
agctgattta 

I. / 



1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780. 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4847 



<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

catctccggg cctttcgacc tg 22 

<210> 42 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

gcgatcggtg cgggcctctt c 21 
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Claims 

1. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in an endogenous gene. 

...i 

2. The use of claim 1, wherein the PTD is not derived from Antennapedia 
and preferably is a PTD derived from the VP22 protein of HSV or from the 
TAT protein of HIV. 

3. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) being not derived from 
Antennapedia and preferably being derived from the VP22 protein of HSV 
or from the TAT protein of HIV 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome. 

4. The use of claim 3, wherein the recognition sites for said site specific 
recombinase is present within an endogenous gene or a transgene. 

5. The use of any one of claims 2 to 4, wherein the TAT protein comprises 
(i) the amino acid sequence YGRKKRRQRRR (SEQ ID NO: 10) or a mutant 
thereof including 

\ (ii) peptides having the amino sequences 
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AGRKKRRQRRR (SEQ ID NO: 22) 
YARKARRQARR (SEQ ID NO: 23) 
YARAAARQARA (SEQ ID NO: 24) 
YARAARRAARR (SEQ ID NO: 25) 
YARAARRAARA (SEQ ID NO: 26) 
YARRRRRRRRR (SEQ ID NO: 27) 
YAAARRRRRRR (SEQ ID NO: 28); 

preferably the TAT protein consists of one of the sequences shown in (i) or 
(ii) above. 

6. The use of any one of claims 2 to 4, wherein the VP22 protein 
comprises the amino acid 16-157 of SEQ ID NO: 14. 

7. The use of any one of claims 1 to 6, wherein the site-specific DNA 
recombinase domain is selected from a recombinase protein derived from 
Cre, Flp, <()C31 recombinase, and R recombinase and preferably is Cre 
having amino acids 15 to 357 of SEQ ID NO: 2 or Flpe having amino acids 
15 to 437 of SEQ ID NO: 4. 

8. The use of any one of claims 1 to 7, wherein the protein transduction 
domain is fused to the N-terminal of the site-specific DNA recombinase 
domain. 

9. The use of any one of claims 1 to 8, wherein the protein transduction 
domain is fused to the site-specific DNA recombinase domain through a 
direct chemical bond or through a linker molecule. 

10. The use of any one of claim 9, wherein the linker molecule is a short 
peptide having 1 to 20, preferably 1 to 10 amino acid residues. 
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11. The use of any one of claims 1 to 10, wherein said fusion protein 
further comprises additional functional sequences. 

12. The use of claim 1, wherein the fusion protein has the sequence 
shown in SEQ ID NOs: 2, 4, 6 or 8. 

13. The use of any one of claims 1 to 12, wherein the living organism is a 
vertebrate, preferably a rodent or a fish. 

♦ 

14. A method for Inducing gene alterations in a living organism which 
comprises administering to said living organism, a fusion protein 
comprising a site-specific DIMA recombinase domain and a protein 
transduction domain as defined in claims 1 to 12, .wherein said living 
organism carries at least one or more recognition sites for said site- 
specific DIMA recombinase integrated in its genome. 

15. A fusion protein comprising 

(a) a site-specific DNA recombinase domain as defined in claims 2 to 9 
and 

(b) a protein transduction domain (PTD) as defined in claims 2 to 9 
provided that when (a) is the wild-type Flp or Cre then (b) is not the full 
length VP22 protein of HSV. 

16. The fusion of claim 15, wherein the (PTD) is derived from the TAT 
protein of HIV. 

17. A DNA sequence coding for the fusion protein of claim 15 or 16, said 
DNA sequence preferably comprising the sequence shown in SEQ ID 
NOs:l, 3, 5, 7, 9, 11, 13, 18 and/or 20. 

18. A vector comprising the DNA sequence of claim 17. 
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19. A host cell transformed with the vector of claim 18 and/or comprising 
the DNA of claim 17. 

20. A method for producing the fusion protein of claim 15 which comprises 
culturing the transformed host cell of claim 19 and isolating the fusion 
protein. 

21. An injectable composition comprising the fusion protein as defined in 
claims 1 to 12 or 15 to 16. 
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Fig. 5 
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Figure 6 
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Fig, 10 
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